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METHODS AND MATERIALS FOR CHARACTERIZING AND 
MODULATING INTERACTION BETWEEN HEREGULIN AND HER3 

RELATED APPLICATIONS 
This application claims priority under Section 119(e) from U.S. Provisional 
Application Serial No. 60/308,431 filed July 27, 2001, the contents of which are 
incorporated herein by reference. 
5 FIELD OF THE INVENTION 

This invention relates generally to HER3 compositions, and to methods of using 
HER3 compositions to modulate the interaction between heregulin and HER3 and for 
example, the biological activities of heregulin and HER3. The invention also relates to 
methods of identifying molecules that bind to the heregulin binding site of HER3. The 
10 invention also relates to methods for in vitro, in situ, and/or in vivo diagnosis and/or 
treatment of mammalian cells or pathological conditions associated with HER3 and 
heregulin. 

BACKGROUND OF THE INVENTION 
15 Cancers are the second most prevalent cause of death in the United States, 

causing 450,000 deaths per year. One in three Americans will develop cancer, and one in 
five will die of cancer. While substantial progress has been made in identifying some of 
the likely environmental and hereditary causes of cancer, there is a need for substantial 
improvement in the diagnosis and therapy for cancer and related diseases and disorders. 
20 A number of so-called cancer genes, i.e., genes that have been implicated in the 

etiology of cancer, have been identified in connection with hereditary forms of cancer 
and in a large number of well-studied tumor cells. Cancer genes are broadly classified 
into "oncogenes" which, when activated, promote tumorigenesis, and "tumor suppressor 
genes" which, when damaged, fail to suppress tumorigenesis. While these classifications 
25 provide a useful method for conceptualizing tumorigenesis, it is also possible that a 
particular gene may play differing roles depending upon the particular allelic form of that 
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gene, its regulatory elements, the genetic background and the tissue environment in 
which it is operating. Human epidermal growth factor receptor 3 is one of these genes 
implicated in the etiology of cancer (see, e.g. Munster et aL, Cancer Res. 2002 Jun 
l;62(ll):3132-7; Menard et aL J Cell Physiol 2000 Feb;l 82(2): 150-62; Basso et aL, 
5 Oncogene 2002 Feb 14;21(8):1 159-66; and Yarden Oncology 2001 ;61 Suppl 2:1-13). 

Human epidermal growth factor receptor 3 (HER3) (see, e.g. Kraus et aL, (1989) 
Proc. Natl. Acad. Sci. U. S. A. 86, 9193-9197) is a member of the type I receptor tyrosine 
kinase (RTK) family, which also includes EGFR, HER2/neu, and HER4 (see, e.g. Ullrich 
et aL, (1984) Nature 309, 418-425; Schechter et aL, (1985) Science 229, 976-978; 

10 Plowman et aL,. (1993) Proc. Nad. Acad. ScL U. S. A. 90, 1746-1750). HER3 forms 
heterodimers with other members of the type I RTK family, including the HER2/mu 
receptor (see, e.g. Sliwkowski et aL, (1994) J. BioL Chem. 269, 14661-14665; Fitzpatrick 
et aL, (1998) FEBS Lett. 431, 102-106; Heldin, G H. (1995) Cell 80, 213-223; Tzahar et 
aL, (1997) EMBO J. 16, 938-4950). The HER2/neu receptor is amplified and 

15 overexpressed in 25-30% of human breast and 8-11% off human ovarian cancers. This 
overexpression correlates with increased morbidity and mortality, and there is evidence 
that the overexpressed HER2 receptor leads to aggressive malignancies (see, e.g. Slamon 
et aL, (1987) Science 235, 177-182; Slamon et aL, (1989) Science 244, 707-712; Plowman 
et aL, (1993) Nature 366, 473-475; Dougall et aL, (1996) DNA Cell BioL 15, 31-40). 

20 Cells expressing only HER2 receptors alone and not other members of the 

EGFR family fail to bmd heregulin, but HER2/neu has high tyrosine kinase activity. 
HER3 is a kinase defective receptor (see, e.g. Guy et aL (1994) Proc Natl Acad Sci USA 
91(17), 8132-6), but has binding affinity for heregulin (see, e.g. Carraway et al. (1994) / 
Biol Chem 269(19), 14303-6). The HER2/HER3 heterodimer forms a high affinity 

25 heregulin receptor with tyrosine kinase activity. Heregulin binding to cells that display 
the HER2/HER3 heterodimer causes a mitogenic response both in vittv and in vivo, so 
understanding this interaction is of medical importance (see, e.g. Aguilar et aL, (1999) 
Oncogene 18, 6050-6062; Sliwkowski et aL (1994) J Biol Chem 269(20), 14661-5; Heldin, 
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C H. (1995) Cell 80(2), 213-23; Tzahar et al. (1997) Embo J 16(16), 4938-50), Alternate 
transcripts of HER3 isolated from an ovarian carcinoma-derived cell line have been 
identified which encode truncated forms of the extracellular domain of HER3, including 
three clones where the protein products were soluble secreted proteins (see, e.g. Lee, H., 
5 and Maihle, N. J. (1998) Oncogene 16(25), 3243-52). A naturally occurring secreted form 
of HER3 has been found to inhibit heregulin-stimulated activation of HER3 (see, e.g. 
Lee et al. (2001) Cancer Res 61(11), 4467-73). This provides evidence thatHER3 could be 
an important target in breast cancer therapy. 

Type I receptor tyrosine kinases typically contain four extracellular domains, a 

10 single hydrophobic transmembrane segment, and a cytoplasmic tyrosine kinase domain 
(see, e.g. Ullrich, A., and Schlessinger, J. (1990) Cell 61, 203-212). HER2/Wis a very 
active tyrosine kinase, but cells expressing HER2/W alone, and not other members of 
the EGFR family, fail to bind heregulin. Conversely, the HER3 receptor binds heregulin 
but has low tyrosine kinase activity (see, e.g. Guy et aL, (1994) Proc. Nad. Acad. Sci. U. S. 

15 A. 91, 8132-8136; Carraway et al., (1994) J. Biol. Chem. 269, 14303-14306). As noted 
above, the HER2/HER3 heterodimer is a high affinity heregulin binding complex with 
signaling activity through the HER-2 kinase domain. To date, the domains of HER3 
involved in ligand binding and heterodimerrzation have not been identified. 

Thus far, the high carbohydrate content (see, e.g. Horan et al., (1995) J. Biol. 

20 Chem. 270, 24604-24608) and the relatively large size (-180 kDa) of the receptors in the 
EGFR family have frustrated structural analysis by x-ray crystallography and NMR, so 
other methods have been sought to illuminate the structure and function of HER3. The 
extracellular domains (ECDs) of the type I RTKs have been divided into four domains: 
I, II, III, and IV, based on sequence analysis (see, e.g. Yarden, Y., and Ullrich, A. (1988) 

25 Annu. Rev. Biochem. 57, 443^78). Domains II and IV are cysteine-rich and are similar 
in sequence. Domains I and HI also have sequence similarity (see, e.g. Yarden, Y, and 
Ullrich, A. (1988) Annu. Rev. Biochem. 57, 443-478; Lax et aL, (1988) Mol. Cell. BioL 8, 
1970-1978). little is known about the specific function of each domain except in EGFR, 
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where several lines of evidence provide evidence that the major determinants for EGF 
binding lie in domain 111. These lines of evidence include the following: 1) the exchange 
of domain III in chicken EGFR for domain 111 from human EGFR confers binding of 
human EGF (see, e.g. Lax et aL, (1989) EMBO J. 8, 421-427; Lax et aL, (1991) Cell 
5 Regul. 2, 337-345); 2) monoclonal antibodies that recognize residues in domain III 
prevent EGF binding to EGFR (see, e.g. Wu et aL, (1989) J. Biol. Chem. 264, 17469- 
17475); 3) cross-linking of EGF to EGFR identified residues in domain HI (see, e.g. 
Summerfield et aL, (1996) J. BioL Chem. 271, 19656-19659; Wu et aL, (1990) Proc. Nad 
Acad Sci. U. S. A. 87, 3151-3155); and 4) limited proteolysis of the ECD of EGFR 

10 produced a fragment that encompassed domain III, which bound transforming growth 
factor, with the observation that binding could be enhanced by including portions of 
domain IV (see, e.g. Kohda et aL, (1993) J. BioL Chem. 268, 1976-1981). Additional 
studies from cross-linking experiments indicated that bound EGF is also close to 
tyrosine 101 in domain I of murine EGFR (see, e.g. Woltjer et aL, (1992) Proc. Nad 

15 Acad. Sci. U. S. A. 89, 7801-7805). Taken together, these experimental results provide 
evidence that domain I and III are close to the ligand-binding region in EGFR and that 
domain III contributes most of the binding. HER3 and EGFR have relatively high 
sequence identity (45% identity in the ECD) and belong to the same family of type 1 
tyrosine kinase receptors; however, they bind a different subset of ligands and differ in 

20 preference for heterodimerization versus homodimerization (see, e.g. Tzahar et aL, (1997) 
EMBO J. 16, 938-4950; Lemmon et aL, (1997) EMBO J. 16, 281-294; Huang et aL, 
(1998) Biochem. J. 331, 113-119; Alimandi et aL, (1995) Oncogene 10, 1813-1821). 

While the existing art provides a Limited understanding of the structure of HER3 
and the interaction between heregulin and HER3, this art does not delineate the domains 

25 in HER3 responsible for interacting with heregulin. Consequently there is a need in the 
art for the identification and characterization of the domains in HER3 involved in this 
interaction so that methods and materials for modulating this interaction can be 
generated. The disclosure provided herein meets this need. 
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SUMMARY OF THE INVENTION 
The disclosure provided herein identifies and characterizes the domains in the 
HER3 receptor that interact with the heregulin ligand. In this context, the present 
5 invention provides antagonists or agonists of the HER3 receptor such as soluble HER3 
variants and methods for using them. The antagonists and agonists described herein find 
utility for, among other things, in vitro, in situ, or in vivo diagnosis or treatment of 
mammalian cells or pathological conditions associated with the aberrant expression of 
heregulin and/ or HER3. 

10 A preferred embodiment of the invention is a HER3 variant polypeptide 

comprising an amino acid sequence which differs from the native sequence HER3 
polypeptide sequence of SEQ ID NO: 2 and has one or more of amino acid 
substitutions at the following residue positions) in SEQ ED NO: 2: E43; N44; K51; E64; 
V66; and V110. Another embodiment of the invention is a non-naturally occurring 

15 HER3 variant polypeptide consisting essentially of amino acid 20 to amino acid 329 of 
HER3 polypeptide sequence of SEQ ID NO: 2; wherein the HER3 variant polypeptide 
specifically binds to the heregulin polypeptide of SEQ ID NO: 4 and exhibits an 
impaired ability to interact with the HER2 polypeptide of SEQ ID NO: 6. 

Related embodiments of this invention include an isolated nucleic acid 

20 comprising a nucleotide sequence encoding these HER3 variants, vectors comprising 
nucleic acids encoding these variants and host cells (e.g. E. co/i) containing these vectors. 
Another related embodiment of the invention is a method of making HER3 variant 
polypeptide by providing a host cell comprising a vector encoding a HER3 variant, 
providing culture media, cidturing the host cell in the culture media under conditions 

25 sufficient to express the HER3 variant polypeptide, recovering the HER3 variant 
polypeptide from the host cell or culture media, and then purifying the HER3 variant 
polypeptide. Optionally the HER3 variant polypeptide is conjugated or linked to one or 
more polyol groups. 
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The invention also provides methods of conducting screening assays to identify 
candidate molecules, such as small molecule compounds, polypeptides or antibodies, 
which act as agonists or antagonists with respect to the interaction between heregulin 
and HER3. One embodiment of the invention is a method of identifying a compound 
5 which specifically binds the heregulin binding domain in a HER3 variant polypeptide 
comprising amino acid 20 to amino acid 329 of HER3 polypeptide sequence of SEQ ID 
NO: 2 comprising contacting the HER3 variant polypeptide with a test compound under 
conditions favorable to binding and determining whether the test compound specifically 
binds to the HER3 variant polypeptide such that a compound which binds to the HER3 

10 variant polypeptide can be identified. Optionally this method can further comprise 
determining whether the test compound inhibits the heregulin induced tyrosine kinase 
activity by contacting mammalian cells that express HER3 receptor with heregulin in the 
presence and absence of the test compound and then monitoring the mammalian cells 
for the tyrosine kinase activity associated with the HER2/HER3 polypeptide complex, 

15 wherein an inhibition in tyrosine kinase activity in the presence of the test compound as 
compared to the absence of the test compound identifies the test compound as an 
inhibitor of heregulin induced tyrosine kinase activity. Alternatively, the method can 
further comprise deterrruning whether the test compound enhances the heregulin 
induced tyrosine kinase activity by contacting mammalian cells that express HER3 

20 receptor with heregulin in the presence and absence of the test compound and then 
monitoring the mammalian cells for the tyrosine kinase activity associated with the 
HER3 polypeptide, wherein an increase in tyrosine kinase activity in the presence of the 
test compound as compared to the absence of the test compound identifies the test 
compound as an enhancer of heregulin induced tyrosine kinase activity. 

25 Yet another embodiment of the invention is a method of determining whether a 

test compound modulates the interaction between heregulin polypeptide of SEQ ID 
NO: 2 and a HER3 variant polypeptide comprising amino acid 20 to amino acid 329 of 
SEQ ID NO: 2 by contacting the HER3 variant polypeptide with a test compound under 
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conditions favorable to binding, contacting the HER3 variant polypeptide with heregulin 
under conditions favorable to binding and then comparing the binding interaction 
between the HER3 variant polypeptide and heregulin with the binding interaction 
between HER3 variant polypeptide and heregulin in the absence of the test compound 
5 such that a compound which modulates the interaction between heregulin and the HER3 
variant polypeptide can be identified. 

Another embodiment of the invention is a method of inhibiting the interaction 
between a heregulin polypeptide having the sequence shown in SEQ ID NO: 4 and 
HER3 polypeptide having the sequence shown in SEQ ID NO: 2 comprising exposing 

10 the heregulin polypeptide to a non-naturally occurring HER3 variant polypeptide 
comprising domains I and II of the HER3 polypeptide sequence of SEQ ID NO: 2; 
wherein the HER3 variant polypeptide specifically binds to the heregulin polypeptide of 
SEQ ID NO: 4; and exhibits an impaired ability to interact with the HER2 polypeptide 
of SEQ ID NO: 6. In preferred embodiments of the invention, the HER3 variant 

15 polypeptide has an amino acid substitution VI 10 in SEQ ID NO: 2. 

Preferred embodiments of the invention also ^include anti-HER3 antibodies 
which are capable of specifically binding to the heregulin binding domain. In a preferred 
embodiment, the invention provides antibodies which specifically bind to a HER3 
polypeptide and inhibit the binding of heregulin ligand to the HER3 receptor. 

20 Optionally, the antibodies are monoclonal antibodies. 

In one embodiment of the invention, there are provided methods for the use of 
heregulin antagonists to block or neutralize the interaction between heregulin and HER3. 
• For example, the invention provides a method comprising exposing a mammalian cell, 
such as an ovarian or breast cell, to one or more heregulin antagonists in an amount 

25 effective to decrease, neutralize or block activity of the heregulin ligand. The cell may be 
in cell culture or in a mammal, e.g. a mammal suffering from, for instance, cancer. Thus, 
the invention includes a method for treating a mammal suffering from a pathological 
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condition such as cancer comprising administering an effective amount of one or more 
heregulin antagonists, as disclosed herein. 

In addition, the invention provides methods of using HER3 agonists (e.g. anti- 
HER3 agonist antibodies which target the heregulin . binding domain) to, for instance, 
5 stimulate or activate HER3 receptor. Such methods will be useful in treating 
pathological conditions characterized by or associated with insufficient HER3 expression 
or activity. The HER3 agonists typically comprise agonistic anti-HER3 antibodies or 
HER3 variant polypeptides. The agonistic activity of such HER3 agonists may comprise 
enhancing the activity of a native ligand for HER3 or activity which is the same as or 

10 substantially the same as (i.e., mimics) the activity of a native ligand for HER3. Typical 
methods of the invention include methods to treat pathological conditions or diseases in 
mammals associated with or resulting from increased or enhanced HER3 or heregulin 
expression and/or activity. In the methods of treatment, HER3 variant polypeptides 
may be administered which preferably block or reduce the respective receptor binding or 

15 activation by heregulin. 

The invention also provides compositions which comprise one or more HER3 or 
heregulin agonists or antagonists. Optionally, the compositions of the invention will 
include pharmaceuticaliy acceptable carriers or diluents. Preferably, the compositions 
will include one or more HER3 or heregulin antagonists or agonists in an amount which 

20 is therapeutically effective to treat a pathological condition or disease. Optionally, the 
compositions will . include one or more HER3 agonists in an amount which is 
therapeutically effective to stimulate signal transduction by HER3. 

The invention also provides articles of manufacture and kits which include one 
or more HER3 or heregulin antagonists or agonists. 

25 
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BRIEF DESCRIPTION OF THE DRAWINGS 
The file of this patent contains at least one drawing executed in color. Copies of 
this patent with color drawing(s) will be provided by the patent and Trademark Office • 
upon request and payment of the necessary fee. 
5 Fig. 1. A, limited proteolysis of HER3-ECD [ ' IV generates two detectable 

fragments, which remain associated under oxidizing conditions. HER3-ECD IIV was 
digested in the presence of trypsin (T) and analyzed in the Phast gel in the presence of 
the reducing agent (S-ME. The proteolytic digest of HER3-ECD IIV was analyzed on an 
SDS-PAGE Phast gel (10-15%) in the absence (lane 4) and the presence (lane 3) of the 

10 reducing agent p-ME. Nonproteolyzed HER3-ECD IIV in the presence of (3-ME is 
shown in lane 2. The two fragments in proteolyzed HER3-ECD I " rv remain linked by one 
or more disulfide bridges. B, a V5 antibody protects HER3-ECD I ' rv from cleavage at the 
C terminus. When HER3-ECD IEV is proteolyzed in the absence of the V5 antibody, the 
C terminus is digested and could not be visualized when probed for the V5 epitope. This 

15 gel shows HER3-ECD I " TV proteolyzed for the time indicated in the presence of an 
antibody against the V5 epitope tag, analyzed on a 4-15% polyacrylamide gradient gel, 
and transferred to a polyvinylidene difluoride membrane. The C-terminal fragment was 
identified by Western blotting using an antibody against the C-terminal V5 epitope. The 
V5 antibody protects the C terminus against proteolytic cleavage and shows that 

20 fragment 2 is the C-terminal fragment 

Fig. 2. A, nonproteolyzed HER3-ECD 1IV binds hrg, but proteolyzed HER3- 
ECD IIV does not The presence (+) or absence (-) of tryspin in the proteolytic digest is 
indicated above the lanes. HER3-ECD I_rv was analyzed on a native Phast gel (10-15%) 
in the absence (lane 1) and presence (lane 3) of equimolar hrg. Proteolyzed HER3-ECD 1 " 

25 ^ was also analyzed in the absence (lane 2) and presence (lane 4) of hrg. Formation of a 
hrg-HER3-ECD I IV complex is observed with the nonproteolyzed HER3-ECD i rv but 
not with the proteolyzed HER3-ECD IIV . B, nonproteolyzed HER3~ECD IIV is eluted by 
hrg in a pull-down assay, but proteolyzed HER3-ECD is not HER3-ECD IIV was 
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applied untreated (-) or digested in the absence (T) or presence of hrg (T + h) to agarose- 
bound hrg. HER3-ECD IIV was detected with an antibody with an epitope in fragment 1 . 
HER3-ECD I_IV was bound to immobilized hrg and could be eluted with free hrg (lane 3) 
and detergent (lane 6), but proteolyzed HER3-ECD I ' IV could not be eluted with hrg 
5 (lane 4) or detergent (lanes 7). HER3-ECD MV that was protected by hrg during 
proteolytic digestion could be recovered by both hrg (lane 5) and detergent (lane 8). 
Trypsin that was treated with phenylmediylsulfonyl fluoride prior to addition to the 
digestion reaction (*) did abolish hrg binding by HER3-ECD I ' rv , which could then be 
recovered by hrg (lane 9). This demonstrates that proteolyzed HER3-ECD irv does not 
10 bind hrg. 

Fig. 3. A, limited proteolysis of HER3-ECD I " IV in the presence of hrg 
generates a different cleavage pattern than in the absence of hrg when analyzed on an 
SDS-PAGE Phast gel (10-15%) under reducing conditions. HER3-ECD IIV was 
proteolyzed in the absence of ligand (-) and in the presence of hrg, NAs-hrg, and EGF, 

15 as indicated above the lanes. Limited proteolysis of HER3-ECD I " TV generates two 
fragments: one at an apparent molecular mass of 24.2 kDa (fragment 1) and another at 
54.1 kDa (fragment 2). In the presence of NAs-hrg (lane 4) and EGF (lane 5), fragment 1 
appears to be identical to fragment 1 in the absence of hrg (lane 2), whereas in the 
presence of wild type hrg (lane 3), fragment 1 appears to have a slighdy higher molecular 

20 mass. The digest of HER3-ECD I_IV in the presence of a binding-competent hrg 
generates a different cleavage pattern than in its absence. Apparently, the cleavage that 
generates fragment 1 can be protected by hrg and is due to a specific interaction between, 
hrg and HER3-ECD I ~ IV . J3, the niinimum amount of hrg required to produce a change in 
the molecular weight of fragment 1 in the SDS gel was determined by varying the molar 

25 ratio of hrg to HER3-ECD I IV . The ratio of hrg to HER3-ECD IIV was varied from 2.0 
to 0 (lanes 2-8), as indicated above the lanes. A change in the molecular weight of 
fragment 1 was observed when the molar ratio of hrg to HER3~ECD irv was between 
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0.9:1 and 1:1. Complete protection of HER3-ECD I " IV by hrg was observed at an 
approximately 1 : 1 ratio of hrg to HEItt-ECD 1 - 1 ^ 

Fig. 4. Domain organization and proteolytic cleavage sites of HER3-ECD I_IV . 
The extracellular region of HER3 contains domains I, II, III, and IV. Sequence positions 
5 are indicated at the start of each domain. The beginning of HEItf-ECD 1 -^ at position 20 
refers to the start of mature HER3-ECD [ " IV after cleavage of the leader sequence. The 
various digests that were analyzed are shown to the left: the digest of HER3-ECD I_IV in 
the presence (+) and absence (-) of hrg and mass spectrometry of the oxidized (-(3ME) 
and reduced (+|3ME) forms of the digest. The position of the cleavage site as determined 

10 by N-terrninal sequencing is indicated at the start of each fragment. Fragment 1 is 
indicated by a gray bar, and fragment 2 is represented by a black bar. Numbers in 
brackets are mass estimates in kDa as determined by mass spectrometry. S-S indicates 
that the two fragments are held together by a disulfide bond. The exact position of the 
cleavage at the C terminus is unknown, and this is indicated by a dashed bar. The 

15 postulated mass of the missing fragments is indicated in brackets below the fragments 
with dashed lines. In summary, hrg protects HER3-ECD from cleavage at position 50 
but not at any other site. 

Fig. 5. HEIG-ECD 1 -™ binds hrg but not EGF or the binding-deficient NA 5 - 
hrg. HER3-ECD MV was analyzed on a native Phast gel (10-15%) in the absence of hrg 

20 (lane 1) and in the presence of hrg (lane 2), NAs-hrg (lane 3), and EGF (lane 4). The 
ligands were used at a 1:1 ratio of hrg to HER3-ECD I ' IV . A complete shift is observed in 
the presence of hrg, and no shift is apparent in NAs-hrg or EGF. 

Fig. 6. Characterization of recombinant HER3-ECD WI by SDS-PAGE 
analysis, mass spectrometry, light scattering, and equilibrium centrifugation. A. y 

25 recombinant HER3-ECD 111 is pure as judged by SDS-PAGE analysis. HER3-ECD 1 - 11 
was expressed in Drosophila S2 cells and purified from the medium. The molecular mass 
appears to be somewhat less than 40 kDa. B t the molecular mass of HER3-ECD 111 is 
between 38.5 and 40 kDa as determined by MALDI mass spectrometry, compared with a 
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theoretical molecular mass of 37.7 kDa. This provide evidences that HER3-ECD 111 has a 
carbohydrate content between 3.3 and 5.7% by weight. C, recombinant HER3-ECD MI is 
a monomer as determined by size exclusion chromatography followed by multiangle light 
scattering. Recombinant HER3-ECD 1 - 11 eluted in a single peak (black line) with a 
5 molecular mass of -34 kDa (S.E. = ± 4%) (black circles). The addition of a 1.3-fold 
molar excess of hrg caused a slight shift in the peak (gray line). The shifted peak has a 
molecular mass of 35 kDa (S.E. = ±3%) (gray squares). Recombinant HER3-ECD 111 is a 
monomer and remains a monomer in the presence of hrg. D, recombinant HER3-ECD 1 " 
11 is a monomer, as determined by ultracentrifugation at three different concentrations 

10 and three different speeds. The average molecular mass of HER3-ECD 111 is 35.0 kDa 
(S.E. = ±0.6%), which corresponds to monomeric HER3-ECD 1 - 11 . 

Fig. 7. Recombinant HER3-ECD 1 * 11 binds hrg. A, HER3-ECD 1 "" was analyzed 
in a pull-down assay in which S-tagged hrg was immobilized on S-protein resin (lanes 3-. 
5) and could be specifically eluted with 1 uM hrg (lane 4) but not PBS (lane 3) or 1 uM 

15 EGF (lane 5). The fragment showed litde nonspecific binding to S-protein resin without 
immobilized S-tagged hrg (lanes 1 and 2) when eluted with PBS (lane 1) or 1 uM hrg 
(lane 2). This shows that domains I and II of HER3-ECD I_IV are involved in hrg binding. 
B, recombinant HER3-ECD 1 " 11 has a Kd of 68 nM as determined by SPR (BIAcore). 
HER3-ECD 1 " 11 showed binding to immobilized trx-hrg with a calculated equilibrium 

20 dissociation constant of 68 nM using the three different concentrations of HER3-ECD 1 " 
11 indicated This shows that HER3-ECD 111 binds hrg. 

Fig. 8. Sequence alignment of domain I of HER3-ECD IIV , IR, and IGF-1R 
indicates the position of the proteolysis site (T) in domain. I of HEItf-ECD 1 -^ relative 
to mutations (*) that decrease ligand binding in the IR. A naturally occurring mutation 

25 that decreases binding in IR is indicated by an open circle. The "hormone binding 
footprint 1 ' in the IGF-1R structure (43) is indicated by shaded areas in the sequence. This 
alignment shows that the proteolysis site HER3-ECD IIV that is protected by hrg 
corresponds to the analogous putative ligand-binding region in domain LI in IGF-1R. 
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Fig. 9. Structural similarity of IGF-1R to HEIG-ECD 1 "™ A t space-filling 
model of domains I and II of the structure of IGF-1R (43) showing the positions of the 
proteolysis sites (red) observed for the HER3-ECD I - IV mapped to the IGF-1R structure. 
The position of mutations (dark blue) that decrease ligand binding in IR are also mapped 
5 to a binding footprint in the IGF-1R structure. Domain LI is light blue, and L2 is 
lavender in IGF-1R (domains I and III in HERJ-ECD 1 ' 17 ), and domain SI (domain II in 
HER3-ECD I4V ) is green. 1740 A 2 of accessible surface area in domain LI are buried by 
domain SI in IGF-1R. The cleavage site at position 50 in HER3-ECD I - IV that is 
protected by hrg (position 18 in IGF-1R) lies on the same face of domain LI in IGF-1R 

10 and is 11 A away from residue 10 in the binding footprint of IR/IGF-1R. The 
unprotected cleavage site at position 270 in HER3-ECDI-IV (position 237 in IGF-1R) 
lies in domain II, is on the opposite side as the binding footprint, and is 40 A away from 
the binding footprint. Residue numbers are shown in the binding footprint The cleavage 
site that is protected by hrg in HER3-ECD IIV is near the binding footprint in domain LI 

15 of IGF-1R. B, backbone representation of domain II of IGF-1R showing the position of 
the proteolysis site at position 270 in HER3-ECD IIV (red cpk) (position 237 in IGF-1R) 
superimposed on the IGF-1R structure. The chain remains connected by a disulfide (red 
ball and stick) when cleavage occurs at position 270. The disulfide bridge connecting 
fragments I and II of HER3-ECD IIV is consistent with a structural model based on 

20 IR/IGF-1R. 

Fig. 10. A, HER3 WI and HER3 1-11 E64A were analyzed in a "pull down assay" in 
which S-tagged hrg was immobilized on S-protein resin (lanes 3-6) and HER3 l ~ n could 
be specifically eluted with 1 JiM hrg and detergent (lane 3 and 5), but HER3 1-11 E 64 * was 
not eluted by either 1 [iM hrg or detergent (lane 4 and 6). Both HER3 WI and HER3 *- 
25 HE64A showed little non-specific binding to S-protein resin without immobilized S-tagged 
hrg (lanes 1-2). This shows that HER3 WI does not bind hrg. B, Recombinant 
HER3 1-11 E64A had no detectable binding greater than 500 nM to the immobilized hrg on 
the BIAcore chip while HER3 I n showed binding to immobilized trx-hrg with a 
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calculated equihbrium dissociation constant of 68 nJVL This shows that HER3 UIE64 A 
does not bind hrg. 

Fig. 11. Characterization of recombinant HER3 i-uema by mass spectrometry 
and circular dichroism. A, the molecular mass of HER3 WI E64A is between 38.4 and 40.6 
5 kDa as. determined by MALDI mass spectrometry, which is comparable to the molecular 
mass of HER3 Ul (38.5 and 40 kDa). This provides evidence that the polypeptide chain 
of HER3 MiEMA is intact B, the CD spectra of HER3 l ~ u and HER3 111 were essentially 
identical between 210 nM and 260 nM provide evidencing that they have the same 
secondary structure. 

10 Fig. 12. A, HER3 "V binds hrg, but HER3 "V ema does not. HER3 IIV and the 

HER3 hW mutants were analyzed on a native Phast gel (10-15%) in the absence (-) and 
presence (+) of equimolar hrg. The HER3 l ' w EMA was also analyzed in the absence (lane 
2) and presence (lane 4) of hrg. Formation of a hrg-HER3 Z " IV complex is observed with 
HER3 I rv , but not with HER3 *- n ^\ B, HER3 w and HER3 E64A were analyzed in 

15 a "pull down assay" in which S-tagged hrg was immobilized on S-protein resin (lanes 4-9) 
and HER3 wv could be specifically eluted with 1 flM hrg and detergent (lane 6 and 8), 
but HER3 l - w ™> Ah could not be eluted with PBS or 1 uM hrg (lanes .5 and 7). A small 
amount of HER3 HVEMA could be eluted with detergent (lane 9) which was probably due 
to some binding or non-specific interaction. Both HER3 1JIV and HER3 ^ E64A showed 

20 little non-specific binding to S-protein resin without immobilized S-tagged hrg when 
eluted with 1 |xM hrg (lane 2-3). This shows that HER3 l ' w E64A does not bind hrg. 

Fig. 13. Space filling model of domains I, EE, and III of the structure of IGF-1R 
showing the positions of the protected proteolysis site (red) and residues E64 (yellow) 
and VI 10 (green) in the HER3 r " IV mapped to the IGF-1R structure (see, e.g. Garrett et 

25 aL (1998) Nature 394(6691), 395-9). The hormone binding footprint is in purple. E64 
and VI 10 of HER3 I TV are in or near the binding footprint in domain LI of IGF-1R. 

Fig. 14. Residues 64 and 110 could form specific interactions with heregulin. A, 
Backbone representation of domain I of IGF-1R showing the position of E64 (yellow 
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cpk) and VI 10 (green cpk) (H30 and L81 in IGF-1R) the residues critical for heregulin 
binding and the protected proteolysis site at position 50 in HER3 1 " rv (red cpk) (18 in 
IGF-1R) superimposed on the IGF-1R structure. E64 (H30 in IGF-1R) and V110 (L81 
in IGF-1R) which are critical for heregulin binding are 12 A from each other and E64 
5 (H30 in IGF-1R) is 13 A from residue 50 (position 18 in IGF-1R) the protected cleavage 
site. B, Backbone representation of the EGF-like domain of heregulin showing the 
position of HI 78 (yellow cpk) and VI 80 (green cpk). HER3 VI 10 and hrg VI 80 could 
form hydrophobic interactions. HER3 E64 and hrg HI 78 could form a salt bridge. 
These specific interactions could be disrupted by the mutations of E64A and VI 1 OA in 

10 HER3 as we observed in our biochemical experiments. 

Fig, 15. The N-terminal residues HI 78 (yellow cpk) and VI 80 (green cpk) of hrg 
can form interactions with the residues E64 (yellow cpk) and VI 10 (green cpk) of HER3 
and that the omega loop of hrg could block the protected proteolysis site (residue 50 red 
cpk) by sterically hindering interaction with trypsin. 

15 Fig. 16. HER3 *L (residues 204342) binds heregulin, but HER3 ^ E ^ A (residues 

20-1342 with a mutation of E64A) and HER3 u <^> (Domain I in figure, residues 184- 
1342) do not A, Specific binding of I 125 hrg to COS 7 cells transiently expressing 
HER3 mutants. The FL HER3 shows a high level of specific binding while the HER3 FL 
E64A anc j HER3 II - CTD show similar binding to cells expressing the vector only. B, 

20 Western blot analysis of transient expression of the HER3 mutants in COS7 cells. The 
WT (HER3 ^ L ) s E64A, and Domain I all express similar levels of protein. 

DETAILED DESCRIPTION OF THE INVENTION 

25 Unless otherwise defined, all terms of art, notations and other scientific terms or 

terminology used herein are intended to have the meanings commonly understood by 
those of skill in the art to which this invention pertains. In some cases, terms with 
commonly understood meanings are defined herein for clarity and/or for ready 
reference, and the inclusion of such definitions herein should not necessarily be 
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construed to represent a substantial difference over what is generally understood in the 
art. Many of the techniques and procedures described or referenced herein are well 
understood and commonly employed using conventional methodology by those skilled 
in the art, such as, for example, the widely utilized molecular cloning methodologies 
5 described in see Ausubel et aL, Current Protocols in Molecular Biology, Wiley 
Interscience Publishers, (1995) and Sambrook et aL, Molecular Cloning: A Laboratory 
Manual 2nd. edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y. As appropriate, procedures involving the use of commercially available kits and 
reagents are generally carried out in accordance with manufacturer defined protocols 

10 and/or parameters unless otherwise noted. 

Briefly, the disclosure provided herein identifies domains I-II of HER3 as the 
major binding region for heregulin and identifies residue positions in HER3 that, when 
mutated, produce heregulin binding polypeptides which have an altered affinity for this 
ligand. Prior to this discovery, the region of ligand binding in HER3 was unknown. 

15 This domain localization is unexpected in view of observations of EGFR (in which the 
binding of its ligand occurs in domain III), a closely related receptor that has a sequence 
identity HER3 and which belongs in the same family of type 1 kinase receptors. See, e.g. 
Singer et aL, JBC (2001), 276(47): 44266-44274. Various aspects and embodiments of 
the invention disclosed herein are provided in the description below. 

20 

I. Abbreviations 

Abbreviations used include: HER, human . epidermal growth factor receptor; 
ECD, extracellular domain; EGF, epidermal growth factor; EGFR, epidermal growth 
factor receptor; hrg, 60-residue EGF-like domain of human heregulin 1; IGF-1R, insulin 
25 growth factor-1 receptor; IR, insulin receptor; NAs-hrg, binding-deficient hrg mutant; 
PBS, phosphate-buffeted saline; RTK, receptor tyrosine kinase; trx-hrg, thioredoxin- 
heregulin fusion; 0-ME, p-mercaptoethanol; PAGE, polyacrylamide gel electrophoresis; 
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BSA, bovine serum albumin; MALDI, matrix-assisted laser desorption/ionization; SPR, 
surface plasmon resonance; MES, 4-morpholineedianesulfonic acid. 

II. Definitions 

5 The terms "HER3", "HER3 polypeptide" or "HER3 receptor" when used herein 

encompass "native sequence HER3 polypeptides" and "HER3 variants" (which are 
further defined herein). "HER3" is a designation given to. those polypeptides which are 
encoded by the nucleic acid molecules comprising the polynucleotide sequences shown 
in SEQ ID NO: 1 and variants or fragments thereof, nucleic acid molecules comprising 

10 the sequence shown in the SEQ ID NO: 1. The HER3 polynucleotides and 
polypeptides of the invention may be isolated from a variety of sources, such as from 
human tissue types or from another source, or prepared by recombinant and/or 
synthetic methods. See, e.g. Plowman et al., PNAS (1990) 87: 4905-4909. As used 
herein, the term "polypeptide" means an amino acid polymer of at least 6 amino acids. 

15 A "native sequence" HER3 polypeptide comprises a polypeptide having the same 

amino acid sequence as the corresponding HER3 polypeptide derived from nature. Such 
native sequence HER3 polypeptides can be isolated from nature or can be produced by 
recombinant and/or synthetic means. The term "native sequence HER3 polypeptide" 
specifically encompasses naturaHy-occurring truncated or secreted forms (e.g., an 

20 extracellular domain sequence), naturally-occurring variant forms (e.g., alternatively 
spliced forms) and natmaUy-occurring allelic variants of the polypeptide. The HER3 
polypeptides of the invention include the HER3 polypeptide comprising or consisting of 
the contiguous sequence of amino acid residues 1 to 1342 of SEQ ID NO: 2. 

A HER3 "extracellular domain" or "ECD" refers to a form of the HER3 

25 polypeptide which is essentially free of the transmembrane and cytoplasmic domains. 
Ordinarily, a HER3 polypeptide ECD will have less than about 1% of such 
transmembrane and/or cytoplasmic domains and preferably, will have less than about 
0.5% of such domains. It will be understood that any transmembrane domain(s) 
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identified for the HER3 polypeptides of the present invention are identified pursuant to 
criteria routinely employed in the art for identifying that type of hydrophobic domain. 
The exact boundaries of a transmembrane domain may vary but most likely by no more 
than about 5 amino acids at either end of the domain as initially identified. 
5 "HER3 variant" means a HER3 polypeptide having at least about 80% amino 

acid sequence identity with the amino acid sequence of a native sequence full length 
HER3 or HER3 ECD. Such HER3 variant polypeptides include, for instance, HER3 
polypeptides wherein one or more amino acid residues are added, or deleted, at the N- 
and/or C-terminus, as well as within one or more internal domains, of the full-length 

10 amino acid sequence. Fragments of the HER3 ECD ate also contemplated. Ordinarily, 
a HER3 variant polypeptide will have at least about 80% amino acid sequence identity, 
more preferably at least about 81% amino acid sequence identity, more preferably at least 
about 82% amino acid sequence identity, more preferably at least about 83% amino acid 
sequence identity, more preferably at least about 84% amino acid sequence identity, more 

15 preferably at least about 85% amino acid sequence identity, more preferably at least 
about 86% amino acid sequence identity, more preferably at least about 87% amino acid 
sequence identity, more preferably at least about 88% amino acid sequence identity, more 
preferably at least about 89% amino acid sequence identity, more preferably at least 
about 90% amino acid sequence identity, more preferably at least about 91% amino acid 

20 sequence identity, more preferably at least about 92% amino acid sequence identity, more 
preferably at least about 93% amino acid sequence identity, more preferably at least 
about 94% amino acid sequence identity, more preferably at least about 95% amino acid 
sequence identity, more preferably at least about 96% amino acid sequence identity, more 
preferably at least about 97% amino acid sequence identity, more preferably at least 

25 about 98% amino acid sequence identity and yet more preferably at least about 99% 
amino acid sequence identity with a HER3 polypeptide encoded by a nucleic acid 
molecule shown in Figure 10 or a specified fragment thereof. HER3 variant 
polypeptides do not encompass the native HER3 polypeptide sequence. Ordinarily, 
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HER3 variant polypeptides are at least about 10 amino acids in length, often at least 
about 20 amino acids in length, more often at least about 30 amino acids in length, more 
often at least about 40 amino acids in length, more often at least about 50 amino acids in 
length, more often at least about 60 amino acids in length, more often at least about 70 
5 amino acids in lengthy more often at least about 80 amino acids in length, more often at 
least about 90 amino acids in length, more often at least about 100 amino acids in length, 
more often at least about 150 amino acids in length, more often at least about 200 amino 
acids in length, more often at least about 250 amino acids in length, more often at least 
about 300 amino acids in length, or more. 
10 The terms "heregulin" or "heregulin polypeptide" when used herein encompass 

"native sequence heregulin polypeptides ,, and "heregulin variants". "Heregulin" is a 

designation given to those polypeptides which are encoded by the nucleic acid molecules 

t 

comprising the polynucleotide sequences shown in SEQ ID NO: 3 and variants thereof, 
nucleic acid molecules comprising the sequence shown in the SEQ ID NO: 3, and 

15 variants thereof as well as fragments of the above which have the biological activity of 
die native sequence heregulin. Variants of heregulin will preferably have at least 80%, 
more preferably, at least 90%, and even more preferably, at least 95% amino acid 
sequence identity with the native sequence heregulin polypeptide shown in SEQ ID NO: 
4. A "native sequence" heregulin polypeptide comprises a polypeptide having the same 

20 amino acid sequence as the corresponding heregulin polypeptide derived from nature. 
Such native sequence heregulin polypeptides can be isolated from nature or can be 
produced by recombinant and/or synthetic means. 

The terms "HER2" or "HER2 polypeptide" when used herein encompass 
"native sequence HER2 polypeptides" and "HER2 variants". "HER2" is a designation 

25 given to those polypeptides which are encoded by the nucleic acid molecules comprising 
the polynucleotide sequences shown in SEQ ID NO: 5 and variants thereof, nucleic acid 
molecules comprising the sequence shown in the Figure SEQ ED NO: 5, and variants 
thereof as well as fragments of the above which have the biological activity of the native 
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sequence HER2. Variants of HER2 will preferably have at least 80%, more preferably, 
at least 90%, and even more preferably, at least 95% amino acid sequence identity with 
the native sequence HER2 polypeptide shown in SEQ ID NO: 6. A "native sequence" 
HER2 polypeptide comprises a polypeptide having the same amino acid sequence as the 
5 corresponding HER2 polypeptide derived from nature. Such native sequence HER2 
polypeptides can be isolated from nature or can be produced by recombinant and/or 
synthetic means. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary 
skill in the art, and generally is an empirical calculation dependent upon probe length, 

10 washing temperature, and salt concentration. In general, longer probes require higher 
temperatures for proper annealing, while shorter probes need lower temperatures. 
Hybridization generally depends on the ability of denatured DNA to re-anneal when 
complementary strands are present in an environment below their melting temperature. 
The higher the degree of desired identity between the probe and hybridizable sequence, 

15 the higher the relative temperature which can be used. As a result, it follows that higher 
relative temperatures would tend to make the reaction conditions more stringent, while 
lower temperatures less so. For additional details and explanation of stringency of 
hybridization reactions, see Ausubel et al, Current Protocols in Molecular Biology , Wiley 
Interscience Publishers, (1995). 

20 "Stringent conditions" or "high stringency conditions", as defined herein, are 

identified by those that: (1) employ low ionic strength and high temperature for washing, 
0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; 
(2) employ during hybridization a denaturing agent, 50% (v/v) formamide with 0.1% 
bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrroHdone/50rnM sodium 

25 phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 
42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 
mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, 
sonicated salmon sperm DNA (50 ug/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, 
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with washes at 42°C in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide 
at 55°C, followed by a high-stringency wash consisting of 0.1 x SSC containing EDTA at 
55°C 

"Moderately stringent conditions" are identified as described by Sambrook et aL, 
5 Molecular Cloning: A Laboratory Manual New York: Cold Spring Harbor Press, 1989, 
and include the use of washing solution and hybridization conditions (e.g., temperature, 
ionic strength and %SDS) less stringent that those described above. An example of 
moderately stringent conditions is overnight incubation at 37°C in a solution comprising: 
20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 

10 phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml 
denatured sheared salmon sperm DNA, followed by washing the filters in 1 x SSC at 
about 37-50°C. The skilled artisan will recognize how to adjust the temperature, ionic 
strength, etc. as necessary to accommodate factors such as probe length and the like. 

Nucleic acid is "operably linked" when it is piaced into a functional relationship 

15 with another nucleic acid sequence. For example, DNA for a presequence or secretory 
leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that 
participates in the secretion of the polypeptide; a promoter or enhancer is operably 
linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate 

20 translation. Generally, "operably linked" means that the DNA sequences being linked 
are contiguous, and, in the case of a secretory leader, contiguous and in reading phase. 
However, enhancers do not have to be contiguous. Linking is accomplished by ligation 
at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide 
. adaptors or linkers are used in accordance with conventional practice. 

25 The terms "amino acid" and "amino acids" refer to all naturally occurring L- 

alpha-amino acids. This definition is meant to include norleucine, ornithine, and 
homocysteine. The amino acids are identified by either the single-letter or three-letter 
designations: 
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Asp D aspartic acid He I isoleucine 

Thr T threonine Leu L leucine 

Ser S serine Tyr Y tyrosine 

Glu E glutamic acid Phe F phenylalanine 

5 Pro P proline His H histidine 

Gly G glycine Lys K lysine 

Ala A alanine Arg R atginine 

Cys C cysteine Trp W tryptophan 

Val V valine Gin Q glutarnine 

10 Met M methionine Asn N asparagine 

In the Sequence Listing and Figures, certain other single-letter or three-letter 
designations may be employed to refer to and identify two or more amino acids or 
nucleotides at a given position in the sequence. 

"Percent (%) amino acid sequence identity" with respect to the ligand or receptor 

15 polypeptide sequences identified herein is defined as the percentage of amino acid 
residues in a candidate sequence that are identical with the amino acid residues in such a 
ligand or receptor sequence identified herein, after aligning the sequences and 
introducing gaps, if necessary, to achieve the maximum percent sequence identity, and 
not considering any conservative substitutions as part of the sequence identity. 

20 Alignment for purposes of determining percent amino acid sequence identity can be 
achieved in various ways that are within the skill in the art, for instance, using publicly 
available computer software such as BLAST, BLAST-2, ALIGN, ALIGN-2 or Megalign 
(DNASTAR) software. Those skilled in the art can determine appropriate parameters 
for measuring alig nm ent, inclu ding any algorithms needed to achieve maximal alignment 

25 over the full-length of the sequences being compared 

The term "antagonist" is used in the broadest sense, and includes any molecule 
that partially or fully blocks, inhibits, or neutralizes one or more biological activities of 
heregulin polypeptide, HER3 polypeptide, or both heregulin and HER3, in vitrv, in situ, or 
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in vivo. Examples of such biological activities of heregulin and HER3 polypeptides 
include binding of heregulin to HER3 and activation of the mitogenic response that is 
observed when heregulin interacts with HER3, as reported in the literature (see, e.g. Lee 
et al., (2001) Cancer Research 61, 4467-4473; Aguilar et al., (1999) Oncogene 18, 6050- 
5 6052; Marte et al., Oncogene 1995 Jan 5;10(l):167-75; and Neve et al., Oncogene 2000 
Mar 23;19(13):1647-56). An antagonist may function in a direct or indirect manner. For 
instance, the antagonist may function to partially or fully block, inhibit or neutralize one 
or more biological activities of heregulin polypeptide, HER3 polypeptide, or both 
heregulin and HER3, /// vitro, in situ, or in vivo as a result of their direct interaction. The 

10 antagonist may also function indirectly to partially or fully block, inhibit or neutralize one 
or more biological activities of heregulin polypeptide, HER3 polypeptide, or both 
heregulin and HER3, /;/ vitro, in situ, or /// vivo as a result of, e.g., blocking or inhibiting 
another effector molecule. The antagonist molecule may comprise a "dual" antagonist 
activity wherein the molecule is capable of partially or fully blocking, inhibiting or 

15 neutralizing a biological activity of both heregulin and HER3. 

The term "agonist" is used in the broadest sense, and includes any molecule that 
partially or fully enhances, stimulates or activates one or more biological activities of 
heregulin polypeptide, HER3 polypeptide, or both heregulin and HER3, in vitro, in situ, or 
in vivo. Examples of such biological activities of heregulin and HER3 may include 

20 activation of downstream signal transduction (see, e.g. Aguilar et al., (1999) Oncogene 
18, 6050-6052; Marte et aL, Oncogene 1995 Jan 5;1 0(1): 167-75; and Neve et aL, 
Oncogene 2000 Mar 23;19(13):1647-56). An agonist may function in a direct or indirect 
manner. For instance, the agonist may function to partially or fully enhance, stimulate or 
activate one or more biological activities of heregulin polypeptide, HER3 polypeptide, or 

25 both heregulin and HER3, in vitro, in situ, or in vivo as a result of its direct binding to 
heregulin or HER3, which causes receptor activation or signal transduction. The agonist 
may also function indirecdy to partially or fully enhance, stimulate or activate one or 
more biological activities of heregulin polypeptide, HER3 polypeptide, or both heregulin 
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and HER3, in vitiv y in situ, or in vivo as a result of, e.g., stimulating another effector 
molecule which then causes heregulin or HER3 receptor activation or signal 
transduction. It is contemplated that an agonist may act as an enhancer molecule which 
functions indirectly to enhance or increase heregulin or HER3 activation or activity. For 
5 instance, the agonist may enhance activity of endogenous heregulin or HER3 in a 
mammal. This could be accomplished, for example, by pre-complexing heregulin or 
HER3 or by stabilizing complexes of the respective ligand with the heregulin or HER3 
receptor (such as stabilizing native complex formed between heregulin and HER3). 

The term "heregulin antagonist" or "HER3 antagonist" refers to any molecule that 

10 partially or fully blocks, inhibits, or neutralizes a biological activity of heregulin or HER3, 
respectively, or both heregulin and HER3, and include, but are not limited to, soluble 
forms of HER3 receptor or such as an extracellular domain sequence of HER3, HER3 
receptor fusion proteins, covalentiy modified forms of HER3 receptor, covalently modified 
forms of HER3 receptor, HER3 variants, heregulin variants, HER3 receptor antibodies 

15 and heregulin antibodies. To determine whether a heregulin antagonist molecule partially 
or fully blocks, inhibits or neutralizes a biological activity of heregulin or HER3, assays may 
be conducted to assess the effect(s) of the antagonist molecule on, for example, binding of 
heregulin to HER3. Such assays may be conducted in known in vitro or in vivo assay 
formats, for instance, in cells expressing HER3. Preferably, the heregulin antagonist 

20 employed in the methods described herein will be capable of blocking or neutralizing at 

least one type of heregulin activity, which may optionally be determined in assays such as 

described herein. To determine whether an HER3 antagonist molecule partially or fully 

blocks, inhibits or neutralizes a biological activity of heregulin or HER3, assays may be 

i 

conducted to assess the effects) of the antagonist molecule on, for example, binding of 
25 heregulin to HER3. Such assays may be conducted in known in vitro or in vivo formats, for 
instance, using cells transfected with HER3. Preferably, the HER3 antagonist employed in 
the methods described herein will be capable of blocking or neutralizing at least one type of 
HER3 activity. Optionally, a heregulin antagonist or HER3 antagonist will be capable of 
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reducing or inhibiting binding of heregulin to HER3 by at least 50%, preferably, by at least 
90%, more preferably by at least 99%, and most preferably, by 100%, as compared to a 
negative control molecule, in a binding assay. In one embodiment, the heregulin antagonist 
or HER3 antagonist will comprise antibodies which will competitively inhibit the binding 
5 of heregulin to HER3. Methods for determining antibody specificity and affinity by 
. competitive inhibition are known in the art (see, e.g., Harlow et aL, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
(1998); Colligan et aL, Current Protocols in Immunology , Green Publishing Assoc., NY 
(1992; 1993); Muller, Meth. Enzym. . 92:589-601 (1983). 

10 The term "heregulin agonist" or "HER3 agonist" refers to any molecule that 

partially or fully enhances, stimulates or activates a biological activity of heregulin or HER3, 
respectively, or both heregulin and HER3, and include, but are not limited to HER3 variant 
polypeptides and heregulin binding domain specific anti-HER3 receptor antibodies. To 
determine whether a heregulin agonist molecule partially or fully enhances, stimulates, or 

15 activates a biological activity of HER3, assays may be conducted to assess the effect(s) of 
the agonist molecule on, for example, HER3-expressing or HER3-transfected cells. Such 
assays may be conducted in known in vitro or in vivo assay formats. Preferably, the heregulin 
agonist employed in the methods described herein will be capable of enhancing or 
activating at least one type of heregulin activity, which may optionally be determined in 

20 assays such as described herein. 

The term "antibody" is used in the broadest sense and specifically covers, for 
example, single monoclonal antibodies against heregulin and HER3 (e.g. those targeting 
the heregulin binding domain in HER3), antibody compositions with polyepitopic 
specificity, single chain antibodies, and fragments of antibodies. "Antibody 1 ' as used 

25 herein includes intact immunoglobulin or antibody molecules, polyclonal antibodies, 
multispecific antibodies (Le., bispecific antibodies formed from at least .two intact, 
antibodies) and immunoglobulin fragments (such as Fab, F(ab*)2, or Fv), so long as they 
exhibit any of the desired agonistic or antagonistic properties described herein. 
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Antibodies are typically proteins or polypeptides which exhibit binding specificity 
to a specific antigen. Native antibodies are usually heterotetrameric glycoproteins, 
composed of two identical light (L) chains and two identical heavy (H) chains. Typically, 
each light c h ain is linked to a heavy chain by one covalent disulfide bond, while the number 
5 of disulfide linkages varies between die heavy chains of different immunoglobulin isotypes. 
Each heavy and light chain also has regukriy spaced intrachain disulfide bridges. Each 
heavy chain has at one end a variable domain (Vh) followed by a number of constant 
domains. Each light chain has a variable domain at one end (Vl) and a constant domain at 
its other end; the constant domain of the light chain is aligned with the first constant 

10 domain of the heavy chain, and the light chain variable domain is aligned with the variable 
domain of the heavy chain. Particular amino acid residues are believed to form an interface 
between the light and heavy chain variable domains (Chothia et aL J. MoL BioL T 186:651- 
663 (1985); Novotny and Haber, Proc. Natl Acad Sci. USA T 82:4592-4596 (1985)). The 
light chains of antibodies from any vertebrate species can be assigned to one of two clearly 

15 distinct types, called kappa and lambda, based on the amino acid sequences of their 
constant domains. Depending on the amino acid sequence of the constant domain of their 
heavy chains, immunoglobulins can be assigned to different classes. There are five major 
classes of immunoglobulins: IgA, IgD, IgE, IgG and IgM, and several of these may be 
further divided into subclasses (isotypes), e.g., IgG-1, IgG-2, IgG-3, and IgG-4; IgA-1 and 

20 IgA-2. The heavy chain constant domains that correspond to the different classes of 
immunoglobulins are called alpha, delta, epsilon, gamma, and mu, respectively. 

"Antibody fragments" comprise a portion of an intact antibody, generally the 
antigen binding or variable region of the intact antibody. Examples of antibody fragments 
include Fab, Fab 1 , F(ab*)2, and Fv fragments, diabodies, single chain antibody molecules, 

25 and multispecific antibodies formed from antibody fragments. 

The term "variable" is used herein to describe certain portions of the variable 
domains which differ in sequence among antibodies and are used in the binding and 
specificity of each particular antibody for its particular antigen. However, the variability is 
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not usually evenly distributed through the variable domains of antibodies. It is typically 
concentrated in three segments called complementarity determining regions (CDRs) or 
hypervariable regions both in the light chain and the heavy chain variable domains. The 
more highly conserved portions of the variable domains are called the framework (FR). 
5 The variable domains of native heavy and light chains each comprise four FR regions, 
largely adopting a P-sheet configuration, connected by three CDRs, which form loops 
connecting, and in some cases forming part of, the p-sheet structure. The CDRs in each 
chain are held together in close proximity by the FR regions and, with the CDRs from the 
other chain, contribute to the formation of the antigen binding site of antibodies (see 

10 Kabat, E.A. et aL, Sequences of Proteins of Immunological Interest , National Institutes of 
Health, Bethesda, MD (1987)). The constant domains are not involved direcdy in binding 
an antibody to an antigen, but exhibit various effector functions, such as participation of 
the antibody in antibody-dependent cellular toxicity. 

The term "monoclonal antibody" as used herein refers to an antibody obtained 

15 from a population of substantially homogeneous antibodies, ie., the individual antibodies 
comprising the population are identical except for possible natoaUy-occurring mutations 
that may be present in minor amounts. Monoclonal antibodies are highly specific, being 
directed against a single antigenic site. Furthermore, in contrast to conventional 
(polyclonal) antibody preparations which typically include different antibodies directed 

20 against different determinants (epitopes), each monoclonal antibody is directed against a 
single determinant on the antigen. 

The monoclonal antibodies herein include chimeric, hybrid and recombinant 
antibodies produced by splicing a variable (including hypervariable) domain of the antibody 
of interest with a constant domain (e.g. "humanized" antibodies), or a light chain with a 

25 heavy chain, or a chain from one species with a chain from another species, or fusions with 
heterologous proteins, regardless of species of origin or immunoglobulin class or subclass 
designation, as well as antibody fragments {e.g., Fab, F(ab')2, and Fv), so long as they exhibit 
the desired biological activity or properties. See, e.g. U.S. Pat No. 4,816,567 and Mage et 
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aL, in Monoclonal Antibody Production Techniques and Applications, pp.79-97 (Marcel 
Dekker, Inc.: New York, 1987). 

The term "immunospecific" as used in "immunospecific binding of antibodies" 
for example, refers to the antigen specific binding interaction that occurs between the 
5 antigen-combining site of an antibody and the specific antigen recognized by that 
antibody. 

"Isolated," when used to describe the various proteins disclosed herein, means 
protein that has been identified and separated and/or recovered from a component of its 
natural environment Contaminant components of its natural environment are materials 

10 that would typically interfere with diagnostic or therapeutic uses for the protein, and may 
include enzymes, hormones, and other proteinaceous or non-proteinaceous solutes. In 
preferred embodiments, the protein will be purified (1) to a degree sufficient to obtain at 
least 15 residues of N-terrninal or internal amino acid sequence by use of a spinning cup 
sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing 

15 conditions using Coomassie blue or, preferably, silver stain. Isolated protein includes 
protein in situ within recombinant cells, since at least one component of the protein natural 
environment will not be present - Ordinarily, however, isolated protein will be prepared by 
at least one purification step. 

"Treatment" or "therapy" refer to both therapeutic treatment and prophylactic or 

20 preventative measures. 

"Mammal" for purposes of treatment or therapy refers to any animal classified as . 
a mammal, including humans, domestic and farm animals, and zoo, sports, or pet 
animals, such as dogs, horses, cats, cows, etc. Preferably, the mammal is human. 

"HER3— related pathological condition" and "heregulin-related pathological 

25 condition" refer to pathologies or conditions associated with abnormal levels of 
expression or activity of HER3 or heregulin, respectively, in excess of, or less than, levels 
of expression or activity in normal healthy mammals, where such excess or diminished 
levels occur in a systemic, localized, or particular tissue or cell type or location in the 
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body. HER3 —related pathological conditions and heregulin-related pathological 
conditions include syndromes characterized by disregulated cell growth such as cancer. 

The terms "cancer", "cancerous", and "malignant" refer to or describe the 
physiological condition in mammals that is typically characterized by unregulated cell 
5 growth. Examples of cancer include but are not limited to, carcinoma including 
adenocarcinoma, lymphoma, blastoma, melanoma, sarcoma, and leukemia. More 
particular examples of such cancers include squamous cell cancer, small-cell lung cancer, 
non-small cell lung cancer, gastrointestinal cancer, Hodgkin's and non-Hodgkin's 
lymphoma, pancreatic cancer, glioblastoma, cervical cancer, ovarian cancer, liver cancer 

10 such as hepatic carcinoma and hepatoma, bladder cancer, breast cancer, colon cancer, 
colorectal cancer, endometrial carcinoma, myeloma (such as multiple myeloma), salivary 
gland carcinoma, kidney cancer such as renal cell carcinoma and Wilms' tumors, basal 
cell carcinoma, melanoma, prostate cancer, vulval cancer, thyroid cancer, testicular 
cancer, esophageal cancer, and various types of head and neck cancer. Optionally, the 

15 cancer will express, or have associated with the cancer cell, heregulin or HER3. By way 
of example, breast and ovarian cancers are among the cancers reported in the literature 
to express HER3. 

II. Methods and Materials 

20 The disclosure provided herein teaches , that the EGF-like domain of heregulin 

(hrg) binds to domains I and II of HER3, in contrast to the EGF receptor, for which 
prior studies have shown that a construct consisting of domains III and portions of 
domain IV binds EGF. As illustrated in Example 2, a hrg binding site is identified by 
limited proteolysis of the recombinant extracellular domains of HER3 (HERS-ECD 1 " 1 ^ 

25 in both the presence and absence of hrg. In the absence of hrg, HERS-ECD 1 -™ is 
cleaved after position Tyr 50 , near the beginning of domain I. Binding of hrg to HER3- 
ECD IIV fully protects position Tyr 50 from proteolysis. To confirm that domain I 
contains a hrg binding site, a HER3 polypeptide comprising only domains I and EL (and 
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not domains III and IV) of HER3 (HER3-ECD r - n ) was generated. This HER3-ECD 111 
polypeptide was then shown to bind hrg with 68 nM affinity. These data provide 
evidence that domains I and II of HER3-ECD I[V act as a functional unit in folding and 
binding of hrg. 

5 The biochemical findings reinforce the structural hypothesis that HER3-ECD HV 

is similar to die insulin-like growth factor-1 receptor (IGF-1R), as follows: 1) The 
protected cleavage site in HER3-ECD IIV corresponds to a binding footprint in domain I 
of IGF-1R; 2) HER3-ECD 1 - 11 binds hrg with a 68 nM dissociation constant, supporting 
the hypothesis that domain I is involved in ligand binding; and 3) the large accessible • 

10 surface area (1 749 A) of domain LI of IGF-1R that is buried by domain SI, as well as the 
presence of conserved contacts in this interface of type 1 RTKs, provide evidences that 
domains LI and SI of IGF-1R function as a unit as observed for HER3-ECD 111 . The 
results are consistent with a model wherein HER3 has a structure similar to IGF-1R and 
binds ligand at a site in corresponding domains. 

15 In this context, the invention provides methods and materials for identifying 

molecules which bind to the heregulin binding site on HER3 as well as methods and . 
materials for modulating heregulin and/or HER3 activity in mammalian cells. Typical 
methods for modulating heregulin and/ or HER3 activity in mammalian cells comprise 
exposing the cells to a desired amount of antagonist or agonist that affects heregulin 

20 interaction with HER3. Preferably, the amount of antagonist or agonist employed will 
be an amount effective to affect the binding and/or activity of the respective ligand or 
respective receptor to achieve a therapeutic effect This can be accomplished in vivo or ex 
vivo in accordance, for instance, with the methods described below and in the Examples. 
f An exemplary condition or disorders to be treated with such antagonists or antagonists is 

25 a cancer whose malignant phenotype is associated with aberrant HER3 or heregulin 
expression. 

Diagnostic methods are also provided herein. For instance, the antagonists or 
agonists may be employed to detect the respective binding partner (heregulin and HER3) 
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or in mammals known to be or suspected of having a heregulin— related pathological 
condition or HER3-related pathological condition. The antagonist or agonist molecule 
may be used, e.g., in immunoassays to detect or quantitate heregulin or HER3 in a 
sample. A sample, such as cells obtained from a mammal, can be incubated in the 
5 presence of a labeled antagonist or agonist molecule, and detection of the labeled 
antagonist or agonist bound in the sample can be performed. Such assays, including 
various clinical assay procedures, are known in the art 

The antagonists and agonists which can be employed in the methods include, but 
are not limited to, soluble forms of HER3 receptor, HER3 receptor immunoadhesins, 
10 fusion proteins comprising HER3, covalendy modified forms of heregulin and HER3, 
HER3 receptor variants anti-HER3 receptor antibodies that target the heregulin binding 
domain, and anti-heregulin antibodies that target the HER3 binding domain. Various 
techniques that can be employed for making the antagonists and agonists are described 
herein. 

15 

A. HER3 POLYPEPTIDES 

The disclosure provided herein includes HER3 polypeptides such as the HER3- 
ECD 1 " 11 polypeptide discussed above as well as HER3 substitution variants such as those 
having an amino acid substitution at residue E64 and/or VI 10. The description below 

20 provides typical schemes for the production of HER3 polypeptides including HER3 
polypeptide variants, by culturing cells transformed or transfected with a vector 
containing HER3 polypeptide encoding nucleic acid. It is, of course, contemplated that 
alternative methods, which are well known in the art, may be employed to prepare HER3 
polypeptides. For instance, the HER3 polypeptide sequence, or portions thereof, may be 

25 produced by direct peptide synthesis using solid-phase techniques (see, e.g., Stewart et aL, 
Solid-Phase Peptide Synthesis . W.H. Freeman Co., San Francisco, CA (1969); Merrifield, 
T. Am. Chem. Soc. T 85:2149-2154 (1963)). In vitro protein synthesis may be performed 
using manual techniques or by automation. Automated synthesis may be accomplished, 



31 



WO 03/01 1897 PCT/US02/23963 



for instance, using an Applied Biosystems Peptide Synthesizer (Foster City, CA) using 
manufacturer's instructions. Various portions of HER3 polypeptides may be chemically 
synthesized separately and combined using chemical or enzymatic methods to produce a 
full-length HER3 polypeptide. 

5 This disclosure identifies amino acids in discontinuous segments in domain I of 

HER3 which appear to be critical for heregulin binding (see, e.g. Table 3). For example, 
a mutation at residue 64 appears to decrease all binding affinity in both recombinant 
HER3 HVE64A and HER3 "i EG4A. w hile a mutation of V110A in and HER3 l ' u causes a 
4 fold increase in heregulin binding affinity. Residues 64 and 110 are in domain I of 

10 HER3 and map to residues H30 and L81 in the hormone binding footprint in the IGF- 
1R. Residues H30 and L81 lie on different strands in IGF.-1R, but are within 12A of 
each other. HER3 VI 10 and hrg VI 80 could form hydrophobic interactions, and HER3 
E64 and hrg HI 78 could form a salt bridge with heregulin. These specific interactions 
could be disrupted by the mutations of E64A and VI 1 OA in HER3 as we observed in 

15 our biochemical experiments. 

A preferred embodiment of the invention is a HER3 variant polypeptide 
comprising an amino acid sequence which differs from the native sequence HER3 
polypeptide sequence of SEQ ID NO: 2 and has one or more of amino acid 
substitutions at the following residue position(s) in SEQ ID NO: 2: E43; N44; K51; E64; 

20 V66; and VI 10. Typically, the variant binds heregulin with an affinity different from a 
HER2 polypeptide that has no amino acid substitution at the enumerated position(s). 
Another preferred embodiment of the invention is a HER3 variant polypeptide 
comprising an amino acid sequence which differs from the native sequence HER3 
polypeptide sequence of SEQ ID NO: 2 and has one or more of amino acid 

25 substitutions at the residue positions identified in Table 3. Another embodiment of the 
invention is a non-naturally occurring HER3 variant polypeptide consisting essentially of 
amino acid 20 to amino acid 329 of HER3 polypeptide sequence of SEQ ID NO: 2; 
wherein the HER3 variant polypeptide specifically binds to the heregulin polypeptide of 
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SEQ ID NO: 4 and exhibits an impaired ability to interact with the HER2 polypeptide of 
SEQ ID NO: 6. Related embodiments of this invention include an isolated nucleic acid 
comprising a nucleotide sequence encoding these HER3 variants, vectors comprising 
nucleic acids encoding these variants and host cells (e.g. E. coli) containing these vectors. 
5 Another related embodiment of the invention is a method of making HER3 variant 
polypeptide by providing a host cell comprising a vector encoding a HER3 variant, 
providing culture media, culturing the host cell in the culture media under conditions 
sufficient to express the HER3 variant polypeptide, recovering the HER3 variant 
polypeptide from the host cell or culture media, and then purifying the HER3 variant 
10 polypeptide. Optionally the HER3 variant polypeptide is conjugated or linked to one or 
more polyol groups. 

The HER3 variants disclosed herein have a number of uses, and can, for 
example, be used to elucidate the signalling mechanisms of the heregulin/HER2/HER3 
complex, a complex which, as noted above, is implicated in the malignant phenotype of 

15 certain cancers such as breast and ovarian cancers. Specifically, as noted herein, the 
molecules of the heregulin/HER2/HER3 complex contain different domains having 
different functional properties including an ability to associate in HER3 homo-oligomers 
as well as HER2/HER3 hetero-oligorners. The HER2/HER3 hetero-oligomers 
associated with heregulin produce a functional signalling complex that is associated with 

20 the biological activity of the molecules. Consequendy, the variants disclosed herein 
which are designed to target a HER3 domain that binds to heregulin and to, for example, 
bind with a greater or lesser affinity to heregulin, can be used to assess interaction 
between HER2 and HER3 as well as the interaction between heregulin and HER2 and 
HER3. As is known in the art, the elucidation of structural domains associated with 

25 biological activities and the signalling mechanisms of molecules that are associated with 
pathologies such as cancer is a crucial step in the identification of novel diagnostic and 
therapeutic methods and materials (e.g. heregulin agonists and antagonists) which can be 
used to treat such pathologies. 
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In this regard, the disclosure provides evidence that a specific subset of amino 
acid residues in the HER3 polypeptide are likely to be involved in the interaction with 
heregulin (and consequently can be mutated in studies designed to characterize the Her2, 
HER3 and heregulin interactions). The data providing this evidence includes a number 
5 of independent and complementary observations mducling: (1) sequence comparison 
with residues in the structurally homologous IGF-1R molecule that are known to be 
associated with ligand binding (see, e.g. Figure 8); (2) crystallographic studies to map a 
heregulin binding footprint in HER3 (see, e.g. Figure 9); and (3) the further limitation of 
this subset to those residues that are most likely exposed to the solvent environment and 
10 therefore likely to interact with heregulin. 

For example, certain residues were chosen for mutation based on their proximity 
to the proteolysis site in domain I of HER3 (see, e.g. Singer et al. (2001) / Biol Cbem 
276(47), 44266-74), conservation to equivalent residues which are critical for ligand 
binding in the IR (see, e.g. Williams et al. (1995) / Biol Cbem 270(7), 3012-6) and in the 
15 hormone binding footprint in the IGF-1R (see, e.g. Garrett et al. (1998) Nairn 
394(6691), 395-9) based on a multiple sequence alignment performed between HER3, IR 
and IGF-1R. As noted above, this subset of HER3 residues includes those shown in 
Tables 2 and 3. The specific results obtained with the examination of members of this 
subset of HER3 residues (e.g. VI 10 and E64) provides further evidence that the 
20 members of this specific subset of amino acid residues in the HER3 polypeptide are 
involved in the interaction with heregulin. 

The HER3 variants disclosed herein include those where the wild type amino 
acid residue at one of the enumerated positions is substituted with any one of the other 
19 naturally occurring amino acids. As is known in the art, there is a reasonable 
25 expectation that a substitution at a wild type residue identified as likely to control affinity 
(for example residue 110) will effect the affinity of the HER3 variant for heregulin. 
Specifically, the pertinent art teaches that in situations where an experimental analysis has 
established that the properties of a specific residue at a particular position within the 
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polypeptide chain are likely involved in maintaining some aspect of a protein's functional 
integrity, an alteration in the size, shape, charge, hydrogen-bonding capacity or chemical 
reactivity of the amino acid side chain at one of these "active" amino acid positions is 
. likely to affect the function of the protein in some way For this reason, .the skilled 
5 artisan would reasonably expect a perturbation in a HER3 residue likely to control the 
affinity of HER3 for heregulin to effect the affinity of HER3 for heregulin. 

The invention further provides recombinant DNA or RNA molecules containing a 
HER3 polynucleotide, fragment, variant thereof, including but not limited to phages, 
plasmids, phagernids, cosmids, YACs, BACs, as well as various viral and non-viral vectors 

10 well known in the art, and cells transformed or transfected with such recombinant DNA or 
RNA molecules. Methods for generating such molecules are well known (see, for example, 
Sambrook et al, 1989, supra). The invention further provides a host-vector system 
comprising a recombinant DNA molecule containing a HER3 polynucleotide, fragment, 
variant thereof within a suitable prokaryotic or eukaryotic host cell. Examples of suitable 

15 eukaryotic host cells include a yeast cell, a plant cell, or an animal cell, such as a 
m amma li a n cell or an insect cell (e.g., a baculovirus-infectible cell such as an Sf9 or 
HighFive cell). Examples of suitable mammalian ceEs include a number of mammalian 
cells routinely used for the expression of recombinant proteins (e.g., COS, CHO, 293, 
293T cells). More particularly, a polynucleotide comprising the coding sequence of 

20 HER3 or a fragment, variant thereof can be used to generate HER3 proteins or fragments 
thereof using any number of host-vector systems routinely used and widely known in the 
art. 

A wide range of host-vector systems suitable for the expression of HER3 proteins 
or fragments thereof are available, see for example, Sambrook et al., 1989, supra; Current 
25 Protocols in Molecular Biology, 1995, supra). Preferred vectors for mammalian 
expression include but are not limited to pcDNA 3.1 myc-His-tag (Invitrogen) and the 
retroviral vector pSRcctkneo (Muller et al., 1991, MCB 11:1785). The host-vector 
systems of the invention are useful for the production of a HER3 protein or fragment 
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thereof. Such host-vector systems can be employed to study the functional properties of 
HER3 and HER3 mutations or analogs. 

Recombinant human HER3 protein can be produced by mammalian cells 
transfected with a construct encoding a HER3 nucleotide. For example, 293T cells can 
5 be transfected with an expression plasmid encoding HER3 or fragment, variant thereof, 
the HER3 or related protein is expressed in the 293T cells, and the recombinant HER3 
protein is isolated, using standard purification methods (e.g., affinity purification using 
anti-HER3 antibodies). In another embodiment, a HER3 coding sequence is subcloned 
into the retroviral vector pSRocMSVtkneo and used to infect various mammalian cell 
10 lines, such as NIH 3T3, COS, CHO and MCF-7 in order to establish HER3 expressing 
cell lines. Various other expression systems well known in the art can also be employed. 
Expression constructs encoding a leader peptide joined in frame to the HER3 coding 
sequence can be used for the generation of a secreted form of recombinant HER3 
protein. 

15 As discussed herein, redundancy in the genetic code permits variation in HER3 

gene sequences. In particular, it is known in the art that specific host species often have 
specific codon preferences, and thus one can adapt the disclosed sequence as preferred 
for a desired host For example, preferred analog codon sequences typically have rare 
codons (i.e., codons having a usage frequency of less than about 20% in known 

20 sequences of the desired host) replaced with higher frequency codons. 

Additional sequence modifications are known to enhance protein expression in a 
cellular host. These include elimination of sequences encoding spurious polyadenylation 
signals, exon/intron splice site signals, transposon-like repeats, and/or other such well- 
characterized sequences that are deleterious to gene expression. The GC content of the 

25 sequence is adjusted to levels average for a given cellular host, as calculated by reference 
to known genes expressed in the host cell. Where possible, the sequence is modified to 
avoid predicted hairpin secondary mRNA structures. Other useful modifications include 
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the addition of a translational initiation consensus sequence at the start of the open 
reading frame, as described in Kozak, Mol. Cell Biol, 9:5073-5080 (1989). 

Embodiments of the invention disclosed herein include a wide variety of art- 
accepted variants of HER3 proteins such as polypep tides having amino acid insertions, 

5 deletions and substitutions. HER3 variants can be made using methods known in the art 
such as site-directed mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed 
mutagenesis (Carter et aL, NucL Acids Res., /5.4331 (1986); Zoller et aL, NucL Acids Rss., 
/0:6487 (1987)), cassette mutagenesis (Wells et aL, Gene, 34:315 (1985)), restriction 
selection mutagenesis (Wells et aL, Pbiios. Trans. & Soc. London SerA, 317:415 (1986)) or 

10 other known techniques can be performed on the cloned DNA to produce the HER3 
variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more 
amino acids along a contiguous sequence that is involved in a specific biological activity 
such as a protein-protein interaction. Among the preferred scanning amino acids are 

15 relatively small, neutral amino acids. Such amino acids include alanine, glycine, serine, 
and cysteine. Alanine is typically a preferred scanning amino acid among this group 
because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the 
main-chain conformation of the variant Alanine is also typically preferred because it is 
the most common amino acid. Further, it is frequendy found in both buried and 

20 exposed positions (Creighton, The Ptvteins, (W.H. Freeman & Co., N.Y.); Chothia, J. MoL 
BioL, 150:1 (1976)). If alanine substitution does not yield adequate amounts of variant, 
an isosteric amino acid can be used. 

HER3 proteins can be generated using standard peptide synthesis technology or 
using chemical cleavage methods well known in the art. Alternatively, recombinant 

25 methods can be used to generate nucleic acid molecules that encode a HER3 protein. In 
one embodiment, nucleic acid molecules provide a means to generate defined fragments of 
the HER3 protein (or variants thereof). 
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Biol., 150:1 (1976)). If alanine substitution does not yield adequate amounts of variant, 
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HER3 proteins can.be generated using standard peptide synthesis technology or 
using chemical cleavage methods well known in the art Alternatively, recombinant 

25 methods can be used to generate nucleic acid molecules that encode a HER3 protein. In 
one embodiment, nucleic acid molecules provide a means to generate defined fragments of 
the HER3 protein (or variants thereof). 
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Modifications of HER3 proteins such as covalent modifications are included 
within the scope of this invention. One type of covalent modification includes reacting 
targeted amino acid residues of a HER3 polypeptide with an organic derivatizing agent 
that is capable of reacting with selected side chains or the N- or C- terminal residues of 
5 the HER3. Another type of covalent modification of the HER3 polypeptide included 
within the scope of this invention comprises altering the native glycosylation pattern of a 
protein of the invention. Another type of covalent modification of HER3 comprises 
linking the HER3 polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in the manner set 
10 forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 
4,179,337. 

B. ANTIBODIES 

It is contemplated that anti-HER3 receptor antibodies that specifically target the 
15 heregulin binding domain in HER3 may also be employed in the presendy disclosed 
methods. Examples of such molecules include neutralizing or blocking antibodies which 
can preferably inhibit binding of heregulin to HER3 receptors. The antt-HER3 
antibodies may be monoclonal antibodies. 

Monoclonal antibodies may be prepared using hybridoma methods, such as those 
20 described by Kohler and Milstein, Nature . 256 :495 (1975). In a hybridoma method, a 
mouse, . hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies 
that will specifically bind to the immunizing agent Alternatively, the lymphocytes may be 
immunized in vitro, 

25 The immunizing agent will typically include a HER3 polypeptide or a fusion 

protein thereof such as a HER3 ECD-IgG fusion protein. The immunizing agent may 
alternatively comprise a fragment or portion of HER3 having one or more amino acids diat 
participate in the binding of HER3 to heregulin. In a preferred embodiment, the 
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throughput screening of chemical libraries, making them particularly suitable for 
identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein 
binding assays, biochemical screening assays, immunoassays, and cell-based assays, which 
5 are well characterized in the art. Assays for, for instance, antagonists are common in that 
they call for contacting the drug candidate with a ligand or receptor polypeptide 
identified herein under conditions and for a time sufficient to allow these two 
components to interact. 

In binding assays, the interaction is binding and the complex formed can be 

10 isolated or detected in the reaction mixture. In a particular embodiment, a preferred 
receptor polypeptide identified herein (e.g. HER3 polypeptide having only domains I and 
II) or the drug candidate is immobilized on a solid phase, e.g., on a micro titer plate, by 
covalent or non-covalent attachments. Non-covalent attachment generally is 
accomplished by coating the solid surface with a solution of the ligand or receptor 

15 polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal 
antibody, specific for the ligand or receptor polypeptide to be immobilized can be used 
to anchor it to a solid surface. The assay is performed by adding the non-immobilized 
component, which may be labeled by a detectable label, to the immobilized component, 
e.g., the coated surface containing the anchored' component When the reaction is 

20 complete, the non-reacted components are removed, e.g., by washing, and complexes 
anchored on the solid surface are detected. When the originally non-immobilized 
component carries a detectable label, the detection of label immobilized on the surface 
indicates that complexing occurred. Where the originally non-immobilized component 
does not carry a label, complexing can be detected, for example, by using a labeled 

25 antibody specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular ligand 
or receptor polypeptide identified herein, its interaction with that polypeptide can be 
assayed by methods well known for detecting protein-protein interactions. Such assays 
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immunizing agent comprises an extracellulat domain sequence of HER3 fused to an IgG 

V 

sequence. 

Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian 
5 sources are desired. The lymphocytes are then fused with an immortalized cell line using a 
suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, 
Monoclonal Antibodies: Principles and Practice , Academic Press, (1986) pp. 59-103). 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells 
of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are 

10 employed The hybridoma cells may be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the 
unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 

1 5 medium"), which substances prevent the growth of HGPRT-dencient cells. 

C. METHODS OF SCREENING MOLECULES THAT BIND TO THE 
HEREGULDSf BINDING SITE IN HER3 

20 The invention also encompasses methods of screening molecules to identify 

those which can act as agonists or antagonists of the heregulin/HER3 interaction. Such 
molecules may comprise small molecules or polypeptides, including antibodies. 
Examples of small molecules include, but are not limited to, small peptides or pep tide- 
like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or 

25 inorganic compounds. The screening assays for drug candidates are designed to identify 
compounds or molecules that bind or complex with the ligand or receptor polypeptides 
identified herein, or otherwise interfere with the interaction of these polypeptides with 
other cellular proteins. Such screening assays will include assays amenable to high- 
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include traditional approaches, such as, e.g., cross-linking, co-irnmunoprecipitation, and 
co-purification through gradients or chromatographic columns. In addition, protein- 
protein interactions can be monitored by using a yeast-based genetic system described by 
Fields and co-workers (Fields and Song, Nature (London) , 340:245-246 (1989); Chien et 
5 al., Proc. Natl. Acad. Sci. USA T 88:9578-9582 (1991)) as disclosed by Chevrayand 
Nathans, Proc. Nad. Acad Sci. USA T 89: 5789-5793 (1991). Many transcriptional 
activators, such as yeast GAL4, consist of two physically discrete modular domains, one 
acting as the DNA-binding domain, the other one functioning as the transcription- 
activation domain. The yeast expression system described in the foregoing publications 

10 (generally referred to as the "two-hybrid system") takes advantage of this property, and 
employs two hybrid proteins, one in which the target protein is fused to the DNA- 
binding domain of GAL4, and another, in which candidate activating proteins are fused 
to the activation domain. The expression of a GALl-JacZ reporter gene under control of 
a GAL4-activated promoter depends on reconstitution of GAL4 activity via protein- 

15 protein interaction. Colonies containing interacting polypeptides are detected with a 
chromogenic substrate for P-galactosidase. A complete kit (MATCHMAKER™) for 
identifying protein-protein interactions between two specific proteins using the two- 
hybrid technique is commercially available from Clontech. This system can also be 
extended to map protein domains involved in specific protein interactions as well as to 

20 pinpoint amino acid residues that are crucial for these interactions. 

Compounds or molecules that interfere with the interaction of a ligand or 
receptor polypeptide identified herein and other intra- or extracellular components can 
be tested as follows: usually a reaction mixture is prepared containing the product of the 
gene and the intra- or extracellular component under conditions and for a time allowing 

25 for the interaction and binding of the two products- To test the ability of a candidate 
compound to inhibit binding, the reaction is run in the absence and in the presence of 
the test compound. In addition, a placebo may be added to a third reaction rnixture, to 
serve as positive control. The binding (complex formation) between the test compound 
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and the intra- or extracellular component present in the mixture is monitored as 
described hereinabove. The formation of a complex in the control reaction(s) but not in 
the reaction mixture containing the test compound indicates that the test compound 
interferes with the interaction of the test compound and its reaction partner. 
5 To assay for antagonists, the ligand or receptor polypeptide may be added to a 

cell along with the compound to be screened for a particular activity and the ability of 
the compound to inhibit the activity of interest in the presence of the ligand or receptor 
polypeptide indicates that the compound is an antagonist to the ligand or receptor 
polypeptide. Alternatively, antagonists may be detected by combining the ligand or 

10 receptor polypeptide and a potential antagonist with membrane-bound polypeptide 
receptors or recombinant receptors under appropriate conditions for a competitive 
inhibition assay. The ligand or receptor polypeptide can be labeled, such as by 
radioactivity, such that the number of polypeptide molecules bound to the receptor can 
be used to determine the effectiveness of the potential antagonist. The gene encoding 

15 the receptor can be identified by numerous methods known to those of skill in the art, 
for example, ligand panning and FACS sorting. Coligan et aL, Current Protocols in 
Immun. . 1(2): Chapter 5 (1991). Preferably, expression cloning is employed wherein 
polyadenylated RNA is prepared from a cell responsive to the ligand or receptor 
polypeptide and a cDNA library created from this RNA is divided into pools and used to 

20 transfect COS cells or other cells that are not responsive to the ligand or receptor 
polypeptide. Transfected cells that are grown on glass slides are exposed to labeled 
ligand or receptor polypeptide. The ligand or receptor polypeptide can be labeled by a 
variety of means including iodination or inclusion of a recognition site for a site-specific 
protein kinase. Following fixation and incubation, the slides are subjected to 

25 autoradiographic analysis. Positive pools are identified and sub-pools are prepared and 
re- transfected using an interactive sub-pooling and re-screening process, eventually 
yielding a single clone that encodes the putative receptor. 
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As an alternative approach, labeled ligand polypeptide can be photoaffinity- 
linked with cell membrane or extract preparations that express receptor molecule. Cross- 
. linked material is resolved by PAGE and exposed to X-ray film. The labeled complex 
containing the receptor can be excised, resolved into peptide fragments, and subjected to 

5 protein micro-sequencing. The amino acid sequence obtained from micro-sequencing 
would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 
library to identify the gene encoding the putative receptor. 

A representative embodiment of this invention comprises a method of screening 
for a molecule that interacts with the heregulin binding domain in HER3 ("HER3 

10 heregulin binding domain") comprising the steps of contacting a population of molecules 
with a polypeptide comprising the HER3 heregulin binding domain, allowing the 
population of molecules and the HER3 heregulin binding domain to interact under 
conditions that facilitate an interaction, determining the presence of a molecule that 
interacts with the HER3 heregulin binding domain, and then separating molecules that 

15 do not interact with the HER3 heregulin binding domain from molecules that do. In a 
specific embodiment, the method further comprises purifying a molecule that interacts 
with the HER3 heregulin binding domain. The identified molecule can be used to 
modulate a function performed by HER3. This embodiment of the invention is well 
suited to screen chemical libraries for molecules which modulate, e.g., inhibit, antagonize, 

20 or agonize or mimic, the activity of HER3 as measured by one of the assays disclosed 
herein' or known in the art (see, e.g. Aigular et al., (1999) Oncogene 18, 6050-6062). The 
chemical libraries can be peptide libraries, peptidomimetic libraries, chemically 
synthesized libraries, recombinant, e.g., phage display libraries, and in vitro translation- 
based libraries, other non-pep tide synthetic organic libraries. 

25 Exemplary libraries are commercially available from several sources (ArQule, 

Tripos/PanLabs, ChemDesign, Pharmacopoeia). In some cases, these chemical libraries 
are generated using combinatorial strategies that encode the identity of each member of 
the library on a substrate to which the member compound is attached, thus allowing 



43 



WO 03/011897 



PCT/US02/23963 



direct and immediate identification of a molecule that is an effective modulator. Thus, in 
many combinatorial approaches, the position on a plate of a compound specifies that 
compound's composition. Also, in one example, a single plate position may have from 
1-20 chemicals that can be screened by adrriinistration to a well containing the 
5 interactions of interest Thus, if modulation is detected, smaller and smaller pools of 
interacting pairs can be assayed for the modulation activity. By such methods, many 
candidate molecules can be screened. 

Many diversity libraries suitable for use are known in the art and can be used to 
provide compounds to be tested according to the present invention. Alternatively, 

10 libraries can be constructed using standard methods. Chemical (synthetic) libraries, 
recombinant expression libraries, or polysome-based libraries are exemplary types of 
libraries that can be used. 

In one embodiment, one can screen peptide libraries to identify molecules that 
interact with HER3 heregulin binding domain protein sequences. In such methods, 

15 peptides that bind to the HER3 heregulin binding domain are identified by screening 
libraries that encode a random or controlled collection of amino acids. Peptides encoded 
by the libraries are expressed as fusion proteins of bacteriophage coat proteins, the 
bacteriophage particles are then screened against the protein of interest. 

Accordingly, peptides having a wide variety of uses, such as therapeutic, 

20 prognostic or diagnostic reagents, are thus identified without any prior information on 
the structure of the expected ligand or receptor molecule. Typical peptide libraries and 
screening methods that can be used to identify molecules that interact with HER3 
heregulin binding domain sequences are disclosed for example in U.S. Patent Nos. 
5,723,286 issued 3 March 1998 and 5,733,731 issued 31 March 1998. 

25 Small molecules and ligands that interact with HER3 heregulin binding domain 

can be identified through related embodiments of such screening assays. For example, 
small molecules can be identified that interfere with protein function, including 
molecules that interfere with the tyrosine kinase activity associated with HER3. 
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A typical embodiment is a method of identifying a compound which specifically 
binds the HER3 heregulin binding domain, comprising the steps of contacting the HER3 
heregulin binding domain with a test compound under conditions favorable to binding; 
and then deterrnining whether said test compound binds to said HER3 heregulin binding 
5 do ma in so that a compound which binds to said domain can be identified. As the 
interaction between various receptor tyrosine kinases and a variety of test compounds 
have been previously described, skilled artisans are familiar with the conditions 
conducive to binding. A specific embodiment of this aspect of the invention includes 
the steps of transfecting cells with a construct encoding HER3, contacting said cells with 

10 said test compound that is tagged or labelled with a detectable marker and then analyzing 
said cells for the presence bound test compound. In contexts where the ttansfected cells 
are observed to preferentially bind the test compound as compared to cells that have not 
been transfected with a HER3 construct, this indicates that the test compounds is 
binding to the HER3 protein expressed by those cells. 

15 A test compound which binds said HER3 heregulin binding domain may then be 

further screened for the inhibition of a biological activity (e.g. tyrosine kinase activity) 
associated with HER3. Such an embodiment includes, for example determining whether 
said test compound inhibits die tyrosine kinase activity associated with HER3 by utilizing 
molecular biological protocols to create recombinant contracts whose enzymological and 

20 biological properties can be examined direcdy. Enzymology is performed for example, 
by measuring tyrosine kinase activity in vitro or in HER3 expressing cells using standard 
assays. 

D. ASSAY METHODS 
25 Diagnostic methods are also provided herein. For instance, the antagonists or 

agonists may be employed to detect HER3 and/or heregulin in mammals known to be 
or suspected of having a HER3 and/or heregulin related pathological condition. The 
antagonist or agonist molecule may be used, e.g., in immunoassays to detect or quantitate 
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heregulin and/ or HER3 in a sample. A- sample, such as cells obtained from a mammal, 
can be incubated in the presence of a labeled antagonist or agonist molecule, and 
detection of the labeled antagonist or agonist bound in the sample can be performed. 
Such assays, including various clinical assay procedures, are known in the art, for instance 
5 as described in Voller et aL, Immunoassays , University Park, 1981. 

Ligand/receptor binding studies may be carried out in any known assay method, 
such as competitive binding assays, direct and indirect sandwich assays, and 
irnmunoprecipitation assays. Cell-based assays and animal models can be used as 
diagnostic methods and to further understand the interaction between the ligands and 

10 receptors identified herein and the development and pathogenesis of the conditions and 
diseases referred to herein. 

In one approach, mammalian cells may be transfected with the ligands or 
receptors described herein, and the ability of the agonists or antagonists to stimulate or 
inhibit binding or activity is analyzed. Suitable cells can be transfected with the desired 

15 gene, and monitored for activity. Such transfected cell lines can then be used to test the 
ability of antagonist^) or agonist(s) to inhibit or stimulate, for example, to modulate B- 
cell proliferation or Ig secretion. Cells transfected with the coding sequence of the genes 
identified herein can further be used to identify drug candidates for the treatment of 
immune related diseases or cancer. 

20 In addition, primary cultures derived from transgenic animals can be used in the 

cell-based assays. Techniques to derive continuous cell lines from transgenic animals are 
well known in the art (see, e.g., Small et aL, MoL CelL Biol . 5:642-648 (1985)). 

E. FORMULATIONS 

25 The HER3 antagonists or agonists described herein, are optionally employed in 

a carrier. Suitable carriers and their formulations are described in Remington's 
Pharmaceutical Sciences. 16th ed, 1980, Mack Publishing Co., edited by Osol et aL 
Typically, an appropriate amount of a pharmaceutically-acceptable salt is used in the carrier 
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to render the formulation isotonic. Examples of the carrier include saline, Ringer's solution 
and dextrose solution. The pH of the carrier is preferably from about 5 to about 8, and 
more preferably from about 7.4 to about 7.8. It will be apparent to those persons skilled in 
the art that certain carriers may be more preferable depending upon, for instance, the route 
5 of administration and concentration of active agent being administered. The carrier may be 
in the form of a lyophilized formulation or aqueous solution. 

Acceptable carriers, excipients, or stabilizers are preferably nontoxic to cells 
and/ or recipients at the dosages and concentrations employed, and include buffers such 
as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid and 

10 methionine; preservatives (such as octadecyldimethylbenzyl ammonium chloride; 
hexamethonium chloride; benzalkonium chloride, benzethonium chloride; phenol, butyl 
or benzyl alcohol; alkyl parabens such as methyl or propyl paraben; catechol; resorcinol; 
cyclohexanol; 3-pentanol; and m-cresol); low molecular weight (less than about 10 
residues) polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; 

15 hydrophihc polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagbe, histidine, arginine, or lysine; monosaccharides, disaccharides, and 
other carbohydrates including glucose, mannose, or dextrins; chelating agents such as 
EDTA; sugars such as sucrose, mannitol, trehalose or sorbitol; salt-forming counter-ions 
such as sodium; and/or non-ionic surfactants such as TWEEN™, PLURONICS™ or 

20 polyethylene glycol (PEG). 

The formulation may also contain more than one active compound as necessary 
for the particular indication being treated, preferably those with complementary activities 
that do not adversely affect each other. 

The HER3 antagonist or agonist described herein, may also be entrapped in 

25 microcapsules prepared, for example, by coacervation techniques or by interfacial 
polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and poly- 
(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for 
example, liposomes, dbumin microspheres, microemulsions, nano-particles and 
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nanocapsules) or in macroemulsions. Such techniques are disclosed in Remington's 
. Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980). 

The formulations to be used for in vivo administration should be sterile. This is 
readily accomplished by filtration through sterile nitration membranes. Sustained-release 
5 preparations may be prepared. Suitable examples of sustained-release preparations 
include semipermeable matrices of solid hydrophobic polymers containing the active 
agent, which matrices are in the form of shaped articles, e.g. films, or microcapsules. 
Examples of sustained-release matrices include polyesters, hydrogels (for example, 
poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides (U.S. Pat. No. 

10 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable 
ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the 
LUPRON DEPOT™ (injectable microspheres composed of lactic acid-glycolic acid 
copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. While 
polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of 

15 molecules for over 100 days, certain hydrogels release proteins for shorter time periods. 

Related embodiments of the invention include methods for the preparation of a 
medication for the treatment of pathological conditions including cancer by preparing a 
HER3 variant polypeptide (e.g. a variant having a substitution at V110) composition for 
ackninistration to a mammal having the pathological condition. A related method is the 

20 use of an effective amount of a HER3 variant polypeptide in the preparation of a 
medicament for the treatment of cancer. Such' methods typically involve the steps of 
including an amount of HER3 variant polypeptide sufficient to inhibit the interaction of 
heregulin and HER3 in vivo and an appropriate amount of a physiologically acceptable 
earner. As is known in the art, optionally other agents can be included in these 

25 preparations. 
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F. MODES OF THERAPY 

The invention further provides methods for modulating HER3, and/or heregulin 
activity in mammalian cells which comprise exposing the cells to a desired amount of 
antagonist or agonist that affects heregulin interaction with HER3. Preferably, the amount 
5 of antagonist or agonist employed will be an amount effective to affect the binding and/ or 
activity of the respective ligand or respective receptor to achieve a therapeutic effect. This 
can be accomplished in vivo or ex vivo in accordance, for instance, with the methods 
described below. Exemplary conditions or disorders to be treated with HER3 agonists or 
heregulin agonists include cancer such as those associated with an aberrant expression of 

10 HER3 and/or heregulin. 

The molecules described herein are useful in treating various pathological 
conditions, such as cancer. These conditions can be treated by stimulating or inhibiting a 
selected activity associated with heregulin or HER3 in a mammal through, for example, 
administration of one or more antagonists or agonists described herein. 

15 Diagnosis in mammals of the various pathological conditions described herein 

can be made by the skilled practitioner. Diagnostic techniques are available in the art 
which allow, e.g., for the diagnosis or detection of cancer or immune related disease in a 
mammal. For instance, cancers may be identified through techniques, including but not 
limited to, palpation, blood analysis, x-ray, NMR and the like. 

20 The antagonists) or agonist(s) can be administered in accord with known 

methods, such as intravenous administration as a bolus or by continuous infusion over a 
period of time, by intramuscular, intraperitoneal, intracerebrbspinal, subcutaneous, intra- 
articular, intrasynovial, intrathecal, oral, topical, or inhalation routes. Optionally, 
adrninistration may be performed through mini-pump infusion using various 

25 commercially available devices. The antagonists or agonists may also be employed using 
gene therapy techniques which have been described in the art. Effective dosages and 
schedules for administering antagonists or agonists may be determined empirically, and 
making such determinations is within the skill in the art Single or multiple dosages may be 
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employed. It is presently believed that an effective dosage or amount of antagonist or 
agonist used alone may range from about 1 ng/kg to about 100 zag/kg of body weight or 
more per day. Interspecies scaling of dosages can be performed in a manner known in the 
art, e,g., as disclosed in Mordenti et al., Pharmaceut Res. T 8:1351 (1991). 
5 As noted above, the HER3 polypeptides, HER3 polypeptide variants, HER3 

polypeptide fragments, HER3 polynucleotides encoding said polypeptides, variants and 
fragments, and the HER3 agents useful in the methods of the invention can be 
incorporated into pharmaceutical compositions suitable for administration into a 
mammal. Such compositions typically comprise at least one HER3 polypeptide, HER3 

10 polypeptide variant, HER3 polypeptide fragment, HER3 polynucleotide encoding said 
polypeptide, variant or fragment, an HER3 agent, or a combination thereof, and a 
pharmaceutically acceptable carrier. Methods for formulating the HER3 compounds of 
the invention for pharmaceutical administration are known to those of skill in the art 
See, for example, Remington: The Science and Practice of Pharmacy, 19 th Edition, 

15 Gennaro (ed.) 1995, Mack Publishing Company, Easton, PA. 

As used herein the language "pharmaceutically acceptable carrier" is intended to 
include any and all solvents, dispersion media, coatings, antibacterial and antifungal 
agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and agents for pharmaceutically 

20 active substances is well known in the art Except insofar as any conventional media or 
agent is incompatible with the active compound, such media can be used in the 
compositions of the invention. Supplementary active compounds can also be 
incorporated into the compositions. A pharmaceutical composition of the, invention is 
formulated to be compatible with its intended route of administration. 

25 The pharmaceutical compositions of the invention, comprising HER3 

polypeptides, HER3 polypeptide variants, HER3 polypeptide fragments, polynucleotides 
encoding said HER3 polypeptides, variants and fragments, as well as HER3 agents, as 
denned above, are administered in therapeutically effective amounts. The 
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"therapeutically effective amount" refers to a nontoxic dosage level sufficient to induce a 
desired biological result (e.g. a diminution of the severity of the symptoms associated 
with a pathological condition such as breast or ovarian cancer). Amounts for 
administration may vary based upon the desired activity, the diseased state of the 
5 mammal being treated, the dosage form, method of administration, patient factors such 
as age, sex, and severity of disease. It is recognized that a therapeutically effective 
amount is provided in a broad range of concentrations. Such range can be determined 
based on in vitro and/ or in vivo assays. 

Therapeutic compositions of the HER3 can be prepared by mixing the desired 

10 HER3 molecule having the appropriate degree of purity with optional pharmaceuticaUy 
acceptable carriers, excipients, or stabilizers (Remington's Pharmaceutical Sciences , 16th 
edition, Osol, A. ed. (1980)), in the form of lyophilized formulations, aqueous solutions 
or aqueous suspensions. Acceptable carriers, excipients, or stabilizers are preferably 
nontoxic to recipients at the dosages and concentrations employed, and include buffers 

15 such as Tris, HEPES, PIPES, phosphate, citrate, and other organic acids; antioxidants 
including ascorbic acid and methionine; preservatives (such as octadecyldimethylbenzyl 
ammonium chloride; hexamethonium chloride; benzalkonium chloride, benzethonium 
chloride; phenol, butyl or benzyl alcohol; alkyl parabens such as methyl or propyl 
paraben; catechol; resorcinol; cyclohexanol; 3-pentanol; and m-cresol); low molecular 

20 weight (less than about 10 residues) polypeptides; proteins, such as serum ?Ibnynin # 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino 
acids such as glycine, glutamine, asparagine, histidine, arginine, or lysine; 
monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or 
dextrins; sugars such as sucrose, mannitol, trehalose or sorbitol; salt-forming counter- 

25 ions such as sodium; and/or non-ionic surfactants such as TWEEN™, PLURONICS™ 
or polyethylene glycol (PEG). 

Additional examples of such carriers include ion exchangers, durnina, aluminum 
stearate, ledthin, serum proteins, such as human serum albumin, buffer substances such 
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as glycine, sorbic acid, potassium soibate, partial glyceride mixtures of saturated 
vegetable fatty acids, water, salts, or electrolytes such as protamine sulfate, disodium 
hydrogen phosphate, potassium hydrogen phosphate, sodium chloride, colloidal silica, 
magnesium trisilicate, polyvinyl pyrrolidone, and cellulose-based substances. Carriers for 
5 topical or gel-based forms include polysaccharides such as sodium 
carboxymethylcellulose or methylcellulose, polyvinylpyrrolidone, polyacrylates, 
polyoxyethylene-polyoxypropylene-block polymers, polyethylene glycol, and wood wax 
alcohols. For all administrations, conventional depot forms are suitably used. Such 
forms include, for example, microcapsules, nano-capsules, liposomes, plasters, inhalation 

10 forms, nose sprays, sublingual tablets, and sustained-release preparations. 

Solutions or suspensions used fot adrriinistering HER3 can include the following 
components: a sterile diluent such as water for injection, saline solution; fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial 
agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 

15 sodium bisulfite; chelating agents such as EDTA; buffers such as acetates, citrates or 
phosphates and agents for the adjustment of tonicity such as sodium chloride or 
dextrose. In one embodiment, a pharmaceutical composition can be delivered via slow 
release formulation or matrix comprising HER3 protein or DNA constructs suitable for 
expression of HER3 protein into or around a site within the body. 

20 HER3 can also be administered in the form of a variety of sustained-release 

preparations. For example, HER3 may be delivered to the lung for slow release via 
encapsulation or carrier materials such as liposomes, or other drug "shells" such as 
dburnin (Albunex by Molecular Biosystems), sugars (Levovist by Schering), gelatins, or 
lipids. Other suitable examples of sustained-release preparations for use with 

25 polypeptides including semipermeable matrices of solid hydrophobic polymers 
containing the protein, which matrices are in the form of shaped articles, e.g., films, or 
microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (e.g., 
poly(2-hydroxyethyl-methacrylate) as described by Langer et al, J. Biomed. Mater. Res. . 
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15: 167-277 (1981) and Langer, Chem Tech. . 12: 98-105 (1982) or poly(vinyklcohol)), 
polylactides (U.S. Patent No. 3,773,919, EP 58,481), copolymers of ^glutamic acid and 
gamma ethyl-L-glutamate (Sidman et al, Biopolymers T 22: 547-556 (1983)), non- 
degradable ethylene-vinyl acetate (Langer et al t supra), degradable lactic acid-glycolic acid 
5 copolymers such as the Lupron Depot (injectable microspheres composed of lactic acid- 
glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid 
(EP 133,988). 

The route of administration may vary depending on the desired effect and/or 
♦ 

outcome. Generally for initiation of an HER3 mediated response, introduction of the 

10 HER3 at or near the desired site of response is utilized. Alternatively additional routes of 
administration, such as a systemic administration of HER3, may be employed. Examples 
of routes of administration include parenteral, e.g., intravenous, intradermal, 
intramuscular, subcutaneous, oral (e.g., inhalation) transdermal (topical), transmucosal 
(e.g. a nasal spray), and rectal administration. The HER3 polypeptide, may also be 

15 a dmini stered by perfusion techniques, such as isolated tissue perfusion, to exert local 
therapeutic effects. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include the following components: a sterile diluent such as 
water for injection, saline solution; fixed oils, polyethylene glycols, glycerine, propylene 
glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl 

20 parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as 
EDTA; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. Regimens of administration may vary. A 
single dose or multiple doses of the agent may be used. Such regimens can vary 
depending on the severity of the disease and the desired outcome. Following 

25 administration of a HER3 polypeptide to the mammal, the mammal's physiological 
condition can be monitored in various ways well known to the skilled practitioner familiar 
with the pathological condition to be treated (e.g. breast or ovarian cancer). 
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It is contemplated that yet additional therapies may be employed in the methods. 
The one or more other therapies may include but are not limited to, administration of 
radiation therapy, cytokine(s), growth inhibitory agent(s), chemotherapeutic agent(s), 
cytotoxic agent(s), tyrosine kinase inhibitors, ras famesyl transferase inhibitors, angiogenesis 
5 inhibitors, and cyclin-dependent kinase inhibitors which are known in the art and defined 
further with particularity in Section I above. In addition, therapies based on therapeutic 
antibodies that target tumor antigens such as Rituxan™ or Herceptin™ as well as anti- 
angiogenic antibodies such as anti-VEGF. 

Preparation and dosing schedules for chemotherapeutic agents may be used 

10 according to manufacturers' instructions or as determined empirically by the skilled 
practitioner. Preparation and dosing schedules for such chemotherapy are also described 
in Chemotherapy Service Ed., M.C. Perry, Williams & Wilkins, Baltimore, MD (1992). The 
chemotherapeutic agent may precede, or follow administration of, e.g. an antagonist, or 
may be given simultaneously therewith. The antagonist, for instance, may also be 

15 combined with an anti-oestrogen compound such as tamoxifen or an anti-progesterone 
such as onapristone (see, EP 616812) in dosages known for such molecules. 

G. ARTICLES OF MANUFACTURE 

In another embodiment of the invention, an article of manufacture containing 
20 materials useful for the treatment of the disorders described above is provided. The 
article of manufacture comprises a container and a label. Suitable containers include, for 
example, bottles, vials, syringes, and test tubes. The containers may be formed from 
a variety of materials such as glass or plastic. The container holds a composition which is 
effective for treating the condition and may have a sterile access port (for example the 
25 container may be an intravenous solution bag or a vial having a stopper pierceable by a 
hypodermic injection needle). The active agents in the composition may comprise 
antagonist^) or agonist(s). The label on, or associated with, the container indicates that 
the composition is used for treating the condition of choice. The article of manufacture 
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may further comprise a second container comprising a pharmaceutically-acceptable 
buffer, such as phosphate-buffered saline, Ringer's solution and dextrose solution. It 
may further include other materials desirable from a commercial and user standpoint, 
including other buffers, diluents, filters, needles, syringes, and package inserts with 
5 instructions for use. 

H. BRIEF CHARACTERIZATION OF ASPECTS OF THE INVENTION 

As disclosed herein, the predominant ligand binding site for heregulin is located 

10 in domain I of HER3. The disclosure identifies two proteolytic sites in HER3-ECD IIV 
at positions 50 and 270. Cleavage at position 50 is fully protected by hrg binding, 
whereas position 270 is unprotected. Hrg protection at position 50 is a result of the 
specific interaction between hrg and HERS-ECD 1 *™. Only two fragments were detected 
by SDS-PAGE and mass spectrometry. The N-terminal fragment generated from 

15 cleavage at residue 50 could not be detected, but its absence could be due to additional 
cleavage sites between residues 20 and 50. The cleavage site at position 270 is not 
protected by hrg, but the fragments generated by this cleavage are held together by 
disulfide bonds. Hrg protects only one of the cleavage sites in a ligand-specific manner. 
The extreme C terminus is also sensitive to proteolysis, which could be blocked by an 

20 antibody against the V-5 epitope tag but not by hrg. 

This protection at position 50 can be explained in two different ways, which 
include binding of hrg at or near position 50 or indirect protection resulting from a 
conformational change in HER3-ECD I_rv , induced by hrg binding. In order to 
distinguish between these two possibilities based on the results of the proteolysis 

25 experiments; we expressed domain I and II of HER3-ECD I4V (HER3-ECD 111 ) and 
assayed it for binding. 

As illustrated in the Examples below, HER3-ECD I n polypeptide is able to bind 
heregulin and the association state and binding properties of HER3-ECD I_n were 
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characterized. For example, HER3-ECD 1 ' 11 is shown to be a monomer and remains in a 
monomeric state when hrg is present This is in contrast to the HER3-ECD IIV 
expressed in S2 cells in which hrg reverses oligomerization to form a monomer (see, e.g. 
Landgraf, R., and Eisenberg, D. (2000) Biochemistry 39, 8503-8511). As discussed 
5 below we analyzed direct hrg binding to HER3-ECD 1-11 and qualitatively demonstrated 
specific binding to hrg binding by a pull-down assay. An equilibrium dissociation 
constant of 68 nM for hrg binding was calculated by surface plasmon resonance in which 
the trx-hrg was immobilized. HER3-ECD 1 ' 11 has 30-fold lower binding affinity than 
HER3-ECD MV (2.3 nM) but a 7-fold greater affinity for its ligand than the extracellular 

10 domain of EGFR (500 nM) (see, e.g. Kohda et aL, (1993) J. Biol. Chem. 268, 1976-1981). 
Consequendy the results demonstrate that domains I and II of HER3-ECD are sufficient 
for hrg binding, and the results from proteolysis protection provide evidence that a 
ligand binding site is located in domain I. 

Limited proteolysis of HER3-ECD WV and the expression of HER3-ECD in 

15 provides evidence that the extent to which different domains of the type I receptor 
tyrosine kinases (EGFR, HER2, HER3, and HER4) contribute to ligand binding may 
not be conserved among die members of this EGFR family. The data provides evidence 
that domain I of HER3-ECD MV contains a site involved in ligand binding, whereas in 
EGFR, multiple lines of evidence provide evidence that the ligand contacts both domain 

20 I and III (see, e.g. Lax et aL, (1989) EMBO J. 8, 421-427; Lax et aL, (1991) Cell ReguL 2, 
337-345; Wu et aL, (1989) J. BioL Chem. 264, 17469-17475; Summerfield et aL, (1996) J. 
Biol. Chem. 271, 19656-19659; Wu et aL, (1990) Proc. Natl. Acad. ScL U. S. A. 87, 3151- 
3155) but that domain III alone is sufficient for ligand binding (see, e.g. Kohda et aL, 
(1993) J. BioL Chem. 268, 1976-1981). It is noteworthy that the Kd for transforming 

25 growth factor binding to the monomeric ECD of EGFR is 500 nM. Domain III of 
EGFR binds transforming growth factor with a Kd of 1.3 uM. The Kd of HER3-ECD I " rv 
is 1.9 nM, whereas the Kd for HER3-ECD I U is 68 nM. Therefore, despite a loss of 
binding compared with HER3-ECD irv , HER3-ECD I U still retains a higher affinity for 
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its ligand than the EGFR ECD. EGFR forms dimers in response to ligand binding (see, 
e.g. Hurwitz et al., (1991) J. Biol. Chem. 266, 22035-22043), whereas HER3-ECDWV 
expressed in S2 cells exists as an oligomer but, following ligand binding, forms a 
monomer (see, e.g. Landgraf, R., and Eisenberg, D. (2000) Biochemistry 39, 8503-8511). 
5 This may be a reflection of the different modes of ligand binding behavior in both 
receptors. The data provides evidence that for HER3-ECD I " rv ligand binding occurs 
pardy in domain I, whereas domain III may also contribute to binding; the relative 
contribution of domains I and II to binding appears to be shifted in favor of domain I, 
for HER3, 

10 An understanding of the activity of type I RTKs has been hindered by the lack of 

a molecular structure for any member of the family. It has been proposed that the 
structure of type I RTK ECDs may be similar to the insulin growth factor-1 receptor 
(IGF-1R) (see, e.g. Jorissen et al., (2000) Protein Sci. 9, 310-324; Garrett et al., (1998) 
Nature 394, 395-399). The structure of the first three domains of IGF-1R was solved by 

15 x-ray crystallography at 2.6-A resolution (see, e.g. Garrett et aL, (1998) Nature 394, 395- 
399). IGF-1R is a type II receptor tyrosine kinase and is a member of the tyrosine kinase 
superfamily, which includes the type I (EGFR/HER3) subfamily (see, e.g. Ullrich, A., 
and Schlessinger, J. (1990) Cell 61, 203-212). HER3 and IGF-1R have significant 
sequence identity in portions of the extracellular domain and have a similar domain 

20 orgarnzation (see, e.g. Yarden, Y., and Ullrich, A. (1988) Annu. Rev. Biochem. 57, 443- 
478; Lax et al., (1988) MoL Cell. Biol. 8, 1970-1978). In addition, this equivalence has 
been proposed based on homology modeling between HER3-ECD IIV and the IGF-1R 
structure (see, e.g. Jorissen et aL, (2000) Protein Sci. 9, 310-324). The IGF-1R structure 
contains three domains (LI, SI, and L2), which are equivalent to domains I, II, and HI in 

25 HER3-ECD wv (Table I), Domains LI and L2 are similar in sequence in IGF-1R (25% 
identity and 41% similarity), as are domains I and 111 in HER3-ECD I IV (30% identity 
and 41% similarity). Domains LI and L2 also have highly similar structures (see, e.g. 
Garrett et al., (1998) Nature 394, 395-399). Alanine-scanning mutagenesis of the insulin 
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receptor, a receptor closely related to IGF-1R, identified four regions in the primary 
sequence of LI that were important for ligand binding (see, e.g. Williams et al., (1995) J. 
Biol. Chem. 270, 3012-3016)/ These regions map to the IGF-1R structure, which reveals 
a "hormone binding footprint" lying on the face of domain LI (see, e.g. Garrett et aL, 
5 (1998) Nature 394, 395-399). A naturally occurring mutation in the insulin receptor at 
residue 58 lies within the footprint (see, e.g. van der Vorm et aL, (1992) J. BioL Chem. 
267, 66-71). Other mutations in the insulin receptor also provide evidence minor 
contributions in domain L2 (see, e.g. Nakae et al., (1995). J. BioL Chem. 270, 22017- 
22022). A model has been suggested in which ligand binding in the IGF-1R involves 

10 domains LI and L2 (see, e.g. Jorissen et aL, (2000) Protein Sci. 9, 310-324), which 
correspond to domains I and III in HER3-ECD I ~ rv . 

Our current analysis provides biochemical evidence for the proposed structural 
similarity of type I (HER3) and II (IGF-1R and IR) RTKs. First, a sequence alignment 
of domain LI in the IR and the IGF-1R and domain I in HEItf-ECD 1 -™ was performed 

15 (Fig. 8). The cleavage site at position 50, which is protected by hrg corresponds to the 
putative ligand-binding region in domain LI in IGF-1R (Fig. 9 A). Residue 17 in IGF-1R 
(position 50 in HEI^-ECD 1 " 1 ^ lies on the same face as the binding footprint and is 11 A 
away from residue 10, which lies within the binding footprint Second, we expressed 
HER3-ECD 111 , which binds hrg with a 68 nM dissociation constant, supporting the 

20 involvement of domain I in ligand binding. Third, the second proteolysis site in which 
the fragments are held together by a disulfide bridge at position 270 is located in the 
cysteine-rich domain II of HER3-ECD irv (position 237 in the IGF-1R). Based on the 
IGF-1R structure, cleavage at this position is expected to result in a disulfide-linked 
fragment (Fig. 9B). Fourth, the large accessible surface area (1782 A 2 ). of domain LI that 

25 is buried by domain SI of IGF-1R and the conserved contacts in this interface in the 
type 1 RTKs (see, e.g. Jorissen et al., (2000) Protein Sci. 9, 310-324) provide evidence 
that these domains function as unit This is consistent with our finding that the 
expression of domain I of HER3 requires the presence of domain II. Therefore, our 
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results are consistent with the proposal that HER3 has a structure similar to IGF-1R. 
These data provide further insight into areas in the HER3 receptor critical for heregulin 
binding. 

The following examples are offered by way of illustration and not by way of 
5 limitation. Various citations are referenced throughout the specification (e.g. PCT 
application No. WO0160397A1). The disclosures of all citations in the specification are 
expressly incorporated herein by reference. 

EXAMPLES 

10 Example 1 : Common Materials 

Soluble extracellular domains of recombinant HER3 (HER3-ECD I ' rv ) were 
purified from Chinese hamster ovary cells by the procedure of Kita et al (see, e.g. Kita et 
aL, (1996) Biochem. Biophys. Res. Commun. 226, 59-69). The 60-residue EGF-like 
domain of human heregulin (51 (residues 177-237) (hrg), the binding-deficient heregulin 

15 mutant (NAs-hrg), and the thioredoxin-heregulin fusion with a C-terminal His and S-tag 
(trx-hrg) were generated and purified as described elsewhere (see, e.g. Landgraf et al, 
(1998) Biochemistry 37, 3220-3228). EGF was purchased from Sigma. Trypsin was 
purchased from Life Technologies, Inc. (catalog number 15400-013 and lot numbers 
12P6334 and 1023241) and Sigma (catalog number T-1426 and lot number 70K7661), 

20 which had been treated with tosylamide-2-phenylethylchloromethyl ketone to remove 
chymo trypsin. 
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Example 2: Proteolytic Examination of Heregulin and HER3 Interaction 

A. Proteolytic Digestion. 

5.0 ul of (HER3-ECD IIV ) at 2.3 mg/ml in storage buffer (20 mM sodium acetate 
pH 5.5 and 40 mM sodium chloride), 10 ul of phosphate-buffered saline (PBS): 136 mM 
5 NaCl, 2.68 mM KC1, 9.289 mM Na2HP0 4 , 1.969 mM KH 2 P0 4 ), and 3.0 ul of trypsin 
(Life Technologies, Inc.; catalog number 15400-013 and lot number 12P6334) at 5.0 
mg/ml in 5.0 mM EDTA and 150 mM sodium chloride were incubated at 37 °C for 1 h. 
Digestion was stopped by adding phenylmethylsulfonyl fluoride at a final concentration 
of 1.0 mM. 5.0 ul of the digested sample were added to 5.0 ul of 2x SDS-PAGE sample 

10 buffer with and without 1.0 mM P-mercaptoethanol ((3-ME). These samples were then 
analyzed by SDS-PAGE on a 10-15% Phast gel (Amersham Pharmacia Biotech) and 
visualized by Coomassie staining. Different sources of trypsin were compared, but the 
best results were obtained with tissue culture grade trypsin from Life Technologies, Inc., 
which may contain other proteases. 

15 HER3-ECD I_Iv was titrated with hrg to determine the rninimum amount required 

for complete protection of HER3-ECD I_IV from proteolysis. Hrg in PBS was incubated 
with HER3-ECD IIV for 30 min at room temperature, followed by trypsin as described 
above. The final concentration of hrg in the digest ranged from 1.7 to 17 uM with a 
constant HERS-ECD 1 " 17 concentration of 7.7 uM. These reaction mixtures were 

20 analyzed by reducing SDS-PAGE as described above. The proteolytic digest was 
repeated with NAs-hrg and EGF using a molar ratio of 1:1. 

B. Proteolysis of HER3-ECD IIV Results in Two Primary Fragments That Remain 
Associated under Oxidizing Conditions. 

First, we digested HEItf-ECD 1 -^ to learn if limited proteolysis ;leads to a 
25 fragment still capable of binding hrg. This results in two principal fragments (Fig. 1 A, 
lane 3) with apparent molecular masses of 24 kDa (fragment 1) and 54 kDa (fragment 2), 
compared with an apparent molecular mass for the nonproteolyzed HER3-ECD i rv of 
80 kDa. Gel electrophoresis of the proteolyzed HER3-ECD I IV in the absence of a 
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reducing agent shows that the two fragments remain linked by disulfide bonds (Fig. 1A, 
lane 4). Fragment 2 was identified as the C-terminal fragment by Western blot using the 
V5 C-terminal epitope-tagged HER3-ECD I " IV (Fig. IB). 

5 C Proteolytic Cleavage of HER3-ECD I_IV Abolishes Heregulin Binding 

To determine the effects of cleavage of HER3-ECD IIV on its ability to bind hrg, 
a gel mobility shift assay was performed In a native gel analysis, proteolyzed HER3- 
ECD 1 "™ (Fig. 2A, lane 2) shows an electrophoretic mobility similar to the 
nonproteolyzed HER3-ECD I IV (Fig. 2A, lane 1). The nonproteolyzed HER3-ECD IIV 

10 shows a discrete shift in the presence of a 1:1 molar ratio of hrg to HEIO-ECD 1 -™ (Fig. 
2A, lane 3). The proteolyzed HER3-ECD IIV does not shift its gel position in the 
presence of hrg (Fig. 2A, lane 4). This finding provide evidences that the two disulfide- 
linked fragments, generated by proteolytic digestion (Fig. 1A, lane 4) do not bind hrg. 

To check whether the proteolyzed HER3-ECD irv binds hrg, we used a pull- 

15 down assay under physiological salt conditions. In this analysis, we measured the HER3- 
ECD 1 '™ and proteolyzed HEI^-ECD 1 "^ that could be dissociated from immobilized 
thioredoxin-heregulin (trx-hrg) by the addition of hrg. Using trx-hrg-coupled S-protein 
agarose, HER3-ECD I " IV was eluted with hrg (Fig. 2B, lane 3). No proteolyzed HER3- 
ECD IIV could be recovered by either elution with hrg (Fig. 2B, lane 4) or detergent (Fig 

20 2B, lane 7). However, HER3-ECD I IV that was protected by hrg during proteolytic 
digestion could be recovered by detergent and competition with hrg (Fig. 2B, lanes 5 and 
8). In addition, a trypsin preparation that was treated with phenylmethylsulfonyl fluoride 
prior to the addition to the digestion reaction did not abolish hrg binding by HER3- 
ECD 1 '™ (Fig. 2B, lane 9). Based on these results, we conclude that the proteolyzed 

25 HER3-ECD I ' IV does not bind hrg. 
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D. Heregulin Protects HER3-ECD MV from Proteolytic Cleavage 

The observation that proteolytic cleavage destroys the binding of HER3-ECD I_IV 
to hrg provide evidences that cleavage occurs at or near the binding site. This led us to 
investigate the proteolysis of HERS-ECD 1 ^ in the presence of hrg. The digestion of 
5 HER3-ECD IIV in the presence of a molar excess of hrg (Fig. 3A, lane 3) produces a 
different cleavage pattern than in the absence of hrg (Fig. 3A, lane 2). Complete 
protection of HER3-ECD IIV was observed (Fig. 3B, lane 3) at an approximate 1:1 molar 
ratio of hrg over HER3-ECD I ~ rv . Fragment 1 has a higher molecular weight in the 
presence of hrg (Fig. 3A, lane 3) than in its absence (Fig. 3A, lane 2), whereas the size of 
10 fragment 2 is apparently not affected by the presence of hrg (Fig. 3A, lanes 2-5). The 
cleavage that produces fragment 1 is apparently blocked by hrg but produces an 
additional, smaller fragment, which was not detected in the SDS gel of the digest of 
HE:R3-ECD I - IV . These data provide evidence that hrg protects HER3-ECD I - IV from the 
proteolytic cleavage that generates fragment 1. 

15 

E. Localization of the Cleavage Sites 

Taken together, these results provide evidence that there are at least two cleavage 
sites, one that is protected by hrg and another that is not To localize the positions of the 
cleavage sites, we used N-terrninal sequencing and mass spectrometry. Fragment 1 begins 

20 at position 51 in the absence of hrg and at position 20 (the first residue in native HER3- 
ECD*- IV ) in the presence of hrg (Fig. 4). The N-terrninal 31 -residue fragment generated 
from the cleavage following residue 50 was not detected by SDS-PAGE or by mass 
spectrometry. The corresponding size by mass spectrometry of fragment 1 is 26.7 kDa in 
the absence of hrg and 30 kDa in the presence (Fig. 4). The difference in mass of 

25 fragment 1 in the presence and absence of hrg is 3270 daltons, which corresponds to the 
predicted size of 3216 daltons, based on the amino acid sequence of the missing 
fragment (Fig. 4). This provide evidences that the amino acid residues between positions 
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20 and 50 of HER3-ECD irv are not glycosylated. Based on these results, we conclude 
that hrg protects HER3-ECD IIV from proteolysis at position 50. 

The cleavage that generates fragment 2 was not blocked by hrg in our assays and 
is located at position 270. The mass of fragment 2 is 47.0 kDa both in the presence and 
5 absence of hrg. The C terminus may contain additional cleavage sites that are not 
protected by hrg, because there is a missing C-terminal fragment (7.5 kDa) both in the 
presence and absence of hrg (Fig. 4). However, cleavage of the C terminus could be 
blocked by an antibody against the V5 epitope tag (Fig. lb). 

The cleavage sites on the carboxyl side of Tyr 50 and Phe 270 are more typical of 
10 chymotryptic rather than tryptic cleavage. This cleavage could be due to chymotrypsin 
present in the solution purchased from life Technologies, Inc. However, the identity of 
the specific protease involved does not affect the finding that hrg protects HER3-ECD 1 ' 
w from cleavage at Tyr 50 but not at Phe 270 . 

15 F. Protection Is Due to a Specific Interaction of hrg with HEItf-ECD 1 -™ 

To show that hrg protection is a result of the specific interaction between hrg 
and HER3-ECD I_IV , we repeated the digestion in the presence of a binding-deficient 
mutant of hrg (NAs-hrg) and EGF. NAs-hrg is deficient in binding due to the fact that 
the N-terminal residues that confer specific HEIO-ECD 1 '™ binding were mutated to 

20 alanine. NAs-hrg and EGF do not shift the position of HER3-ECD I - IV in a gel mobility 
shift assay (Fig. 5, lanes 3 and 4). The cleavage pattern obtained in the presence of NAs- 
hrg and EGF matches that of unprotected HER3-ECD WV (Fig. 3A, lanes 4 and 5 versus 
lane 2), further confirming that they do not bind hrg. Therefore, the difference in the 
cleavage pattern in the presence of hrg is due to a specific interaction between hrg and 

25 HER3-ECD IIV . 
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Example 3: Heregulin Binding Assays 

A. Gel Mobility Shift Assays 

HEBJ-ECD 1 '™ (1 .4 ul) in storage buffer was incubated with 9.6 ul of hrg (6.6 uM 
in PBS) for 30 min at room temperature. 4.0-ul aliquots of this mixture were run on a 10- 
5 15% Phast gel under native conditions. This assay was repeated for NAs-hrg, EGF, and 
ptoteolyzed HER3-ECD I " IV under the same conditions. 

B. "Pull-down" Assay Using Immobilized Trx-hrg 

The ability of proteolyzed HER3-ECD IIV to bind hrg was analyzed by a pull- 

10 down assay. S-protein agarose (0.5 ml) (Novagen) was spun down and resuspended three 
times in PBS. Aliquots of 100 ul were spun again, and trx-hrg (600 nM) and bovine 
serum albumin (BSA) (1 mg/ml) were added to the resin. After incubating 15 min at 4 
°C, the resin was washed five times with PBS, and blocked again with BSA. The 
following samples were added to each aliquot of resin: HER3-ECD I " IV , proteolyzed 

15 HER3-ECD I - IV , HER3-ECD I " IV protected by hrg during proteolysis, and HEIG-ECD 1 -™ 
incubated with trypsin deactivated with 1 mM phenylmethylsulfonyl fluoride prior to the 
proteolysis reaction. HER3-ECD I " rv was at 600 nM. BSA (1 mg/ml) was also added to 
each aliquot. Following a 15-min incubation at 4 °C, the resin was spun down and 
resuspended three times in PBS. Each aliquot of resin was then resuspended in PBS 

20 containing 1 uM hrg or PBS. After a 1-min incubation at 4 °C, the samples were spun 
down, and the supernatant was diluted with 2x SDS-PAGE sample buffer containing 1.0 
mM p-ME. Each aliquot of resin was then resuspended in SDS-PAGE sample buffer. 
The samples were analyzed by SDS-PAGE on a 4-15% polyacrylamide gradient gel (Bio- 
Rad) and visualized by Western blotting and chemttuminescence using a polyclonal 

25 antibody directed against the HER3-ECD (E38530; Transduction Laboratories, 
Lexington, KY) followed by incubation with a secondary antibody conjugated to 
horseradish peroxidase (Invitrogen). 
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The ability of HER3-ECD 1 " 11 to bind hrg was also evaluated in the pull-down 
assay in the same way, widi the following exceptions. HER3-ECD^ U (600 nM) was 
added to the trx-hrx and control resins, and the individual samples were resuspended in 
the various competing ligands indicated in Fig. 5. HER3-ECD 1 - 11 was visualized by 
5 Western blotting and chemiluminescence using a monoclonal antibody directed against 
the V-5 epitope conjugated to horseradish peroxidase (Invitrogen). 

Example 4: Identification of the Digested Fragments and Amino Acid Analysis 

Proteolytically digested HER3-ECD J - IV was diluted with sample buffer 

10 containing (3-ME and run on a 4-15% polyacrylamide gradient gel (Bio-Rad) as described 
above. The samples were then transferred to a polyvinylidene difluoride membrane, and 
the fragments were visualized by Ponceau stain. Edman sequencing of excised bands was 
performed in the UCLA Protein Microsequencing Facility using a Porton-Beckman Gas 
Phase Sequencer, and the first 10 residues of each fragment were identified. HER3- 

15 ECD I_IV was proteolyzed in the presence of an equimolar amount of an antibody against 
the C-terminal V5 epitope for 30, 60, and 120 min. The C-terminal fragment was 
identified by Western blotting using an antibody against the C-terminal V5 epitope 
(Invitrogen). 

For the amino acid analysis the molar concentrations of HER3-ECD I - IV , hrg, and 
20 NAs-hrg were determined by amino acid analysis at the UCLA Protein Microsequencing 
Facility. The samples were hydrolyzed in 6 N HC1 at 110 °C for 18 h under a vacuum in 
a nitrogen atmosphere. The hydrolysate was visualized with phenylisothiocyanate. The 
. phenylisothiocyanate amino acids were analyzed on a reverse phase column (Novapak) 
using a sodium acetate-acetonittile gradient (see, e.g. Bidlingmeyer et aL, (1984) J.. 
25 Chromatogr. 336, 93-104; Cohen, S. A., and Strydom, D. J. (1988) Anal. Biochem. 174, 
1-16). 
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Example 5: Matrix-assisted Laser Desorptioa Ionization (MALDR Mass Spect rometry 

We used MALDI time-of-flight mass spectrometry to measure the molecular 
mass of proteolyzed HER3-ECD IIV and HER3-ECD" 1 . The mass spectrometry was 
performed on a Voyager RP machine (PerCeptive Biosystems, Frarningham, 
5 Massachusetts) with BSA as an internal standard. 0.3 ul of HEr^-ECD 1 "™ (2.3 mg/ml) 
in 20 mM sodium acetate pH 5.5, 40 mM sodium chloride was mixed with 0.5 ul of 10 
mg/ml sinapinic acid in 70% acetonitrile, 0.1% trifluoracetate and dried on die MALDI 
plate. The digests (plus and minus hrg) of HEE^-ECD 1 '^ were prepared in the same 
way, except that the sample was taken direcdy from the digest reaction after 1 h of 
10 incubation time. The digests were run under oxidizing and reducing conditions. The 
digests were reduced by the addition of 1 mM dithiothreitol prior to mixing the sample 
with the matrix. The molecular mass of HER3-ECD I " n (12 uM) in PBS was measured 
under the same conditions. 

15 Example 6: Sequence Alignments and Surface Area Calculations 

An initial multiple sequence alignment was performed with the GCG program 
Pileup (Wisconsin Package, SeqLab, SeqWeb) using the Blossum 62 matrix (see, e.g. 
Henikoff, S., and Henikoff, J. G. (1992) Proc. Nad. Acad. ScL U. S. A. 89, 10915-10919). 
Empirical observations of the N terminus were used in conjunction with the initial 

20 alignment to produce a final alignment We used the GCG program Bestfit (Wisconsin 
Package, SeqLab, SeqWeb) (see, e.g. Smith, T., and Waterman, M. (1981) Adv. AppL 
Math. 2, 482-489) to find the segment of greatest similarity between two sequences. The 
amount of solvent-exposed surface area of domain LI that is buried by domain SI of 
IGF-1R was calculated using the GCG program Areaimol (see, e.g. Lee, B., and 

25 Richards, F. M. (1971) J. MoL Biol. 55, 379-400; Collaborative Computational Project. 
(1994) Acta Crystaliogr. Sect. D 50, 760-763). 
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Example 7: Cloning and Expression of HER3-ECD 1 ' 11 

The HER3 cDNA was amplified from the pJTH3 plasmid provided by Amgen, 
using the N-terminal primer (CTA GTC TCT AGA TCC GAG GTG GGC AAC TCT) 
(SEQ ID NO: 7) and C-terminal primer (TAC CGA TCT AGA TTT CGG ACA GAG 
5 ACC CCC) (SEQ ID NO: 8). Following amplification, HER3-ECD 1 ' 11 (residues 20-329) 
was cloned into the BgUl and Xbal site of the pMT/BiP/V5-His-A expression vector 
(Invitrogen, Carlsbad, CA). This vector contains an N-terminal Drosophila leader 
sequence and C-terminal His tag and V5 epitope tags. S2 cells were co-trans fected with 
the vector containing HER3-ECD 111 and the pCoHYGRO vector (Invitrogen), which 

10 provides hygromycin resistance. We obtained a stable cell line after 3 weeks of selection 
with 300 ug/ml hygromycin (Invitrogen). Five liters of S2 medium (Sigma) with 1% fetal 
bovine serum were inoculated with S2 cells and grown to a cell density of 5 x 106 in a 
spinner flask (Bellco) and then induced for 3 days at room temperature with 500 uM 
G1SO4. HER3-ECD W1 was secreted to the medium. We used an ammonium sulfate cut 

15 (80%, w/v) to precipitate the protein from the medium 1 . The pellet was resolubilized in 
20 mM Tris, pH 7.9, 1.5 M NaCl, and 0.1% Tween 20. The solution was dialyzed into 20 
mM Tris, pH 7.9, 0.5 M NaCl, and 5 mM imidazole and then purified on a 5-ml 
Amersham Pharmacia Biotech "HITRAP Ch^ting" column (Amersham Pharmacia 
Biotech), loaded with NiSQ4- FoEowing elution from the column, the protein was 

20 dialyzed into 20 mM sodium potassium phosphate, pH 10. The protein was loaded onto 
an anion exchange column (Bio-Rad), and pure HER3-ECD 1 " 11 was collected from the 
flow-through. 

Example 8: Multiangle Light Scattering of HER3-ECD 111 
25 Size exclusion chromatography was performed using a TosoHaas G3000SWXL 

column, followed by light scattering on a miniPAWN three-angle light-scattering 
instrument (Wyatt Technologies, Santa Barbara, CA). Data analysis was carried out using 
ASTRA software. HER3-ECD r i1 112 uM) in 20 mM Hepes, pH 7.5, was injected onto 
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the column at a flow tate of 0.4 ml/min. The refractive index increment (d/z/dV) for the 
protein portion of the HER3-ECD 111 in this buffer was assumed to be equal to that of 
monomeric BSA (Sigma) (dn/dc = 0.181 cm 3 /g). The carbohydrate dfi/dc contribution 
was estimated to be 0.157 cm 3 /g and was integrated into the final dn/dc (0.180 cm 3 /g) 
5 proportionally. Calculated extinction coefficients uncorrected for folding effect were 
used. The error to noise in the light scatter data is about ±3%. An error in the assumed 
extinction coefficient would result in a proportional error in the calculated molecular 
mass. 

10 Example 9: Ultracentrifugation of HER3-ECD 111 

Sedimentation equilibrium was performed at 4 °C in a Beckman Optima XL-A 
analytical ultracentrifuge using absorption optics at 280 nm. A 12-mm path length six- 
sector cell was used to measure protein samples at initial A28O values of 0.15, 0.35, and 
0.75. All samples were in PBS. Sedimentation equilibrium profiles were measured at 

15 12,000 and 15,000 rpm. The data were initially fitted with a nonlinear least-squares 
exponential fit for a single ideal species using Origin (version 3.01). Since no 
concentration or speed dependence of the molecular weight was apparent, the Beckman 
global analysis software (the "multifit" option of the above mentioned software) was 
used to analyze all six scans simultaneously. A partial specific volume of 0.712 calculated 

20 from the amino acid composition and corrected to 4 °C was used (see, e.g. Cohn, E. J., 
and Edsall, J. T. (1943) in Proteins, Amino Acids and Peptides as Ions and Dipolar Ions 
(Colin, E. J. , and Edsall, J. T., eds), pp. 370-381, Reinhold Publishing Corp., New York; 
Laue et al., (1992) in Analytical Ultracentrifugation in Biochemistry and Polymer Science 
(Harding, S. E. , Rowe, A. J. , and Horton, J. C, eds), pp. 90-125, The Royal Society of 

25 Chemistry, Cambridge, United Kingdom). 
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Example 10: Surface Plasmoa Resonance Measurement of Binding between HER3- 
ECD In and the EGF-like Domain of Heregulin 

Trx-hrg (13 uM) in MES buffer (100 mM pH 6.0) was immobilized on a BIAcore 

CMS chip using standard N-hydroxysucdriimide/l-ethyl-3-(3- 

5 dimediylaminopropyl)carbodiirrude amine-coupling chemistry. The surface of the chip 

was blocked with ethanolamine and could be regenerated with 5.0 M NaCl and washes 

with running buffer (PBS + surfactant). HER3-ECD 1 " 11 (filtered through a 0.1-um filter 

unit (Millipore, Bedford, MA)) in running buffer was applied to the chip at various 

concentrations (in triplicate) to determine the dissociation constant. Data were analyzed 

10 with the BIAevaluation software. Competition experiments were carried out using hrg at 

an equimolar concentration to HER3-ECD WI . 

Example 11: Recombinant HER3-ECD 1 " 11 Is Su fficient for Heregulin Binding 

To show that a ligand binding site is located within domain I of HER3-ECD I " IV , 
15 we recombinantly expressed domain I of HER3 alone (HER3-ECD 1 ) as well as domains 
I and II together (HER3-ECD 111 ). Expression of domain I alone resulted in an 
improperly folded protein because of incorrect disulfide formation, so only HER3- 
ECD HI could be evaluated for ligand binding. The recombinant form of HER3-ECD MI 
(residues 20-329) was expressed in Drosophila S2 cells and purified from the medium. The 
' 20 protein was found to be pure by SDS-PAGE analysis with an apparent molecular mass 
of 40 kDa (Fig. 6A). We used MALDI mass spectrometry to estimate the carbohydrate 
content The molecular mass of HER3-ECD 111 was found to be between 38.5 and 40 
kDa, compared with a theoretical molecular mass of 37.7 kDa (Fig. 6B). This provide 
evidences that HER3-ECD in has a carbohydrate content between 3.3 and 5.7% by 
. 25 weight 

Size exclusion chromatography followed by multiangle light scattering was used 
to show that recombinant HER3-ECD 111 is a monomer. The concentration of HER3- 
ECD 1 " 11 at the time of light scattering was 1.8 uM. At this concentration, a single peak 
was observed with a molecular mass of ~34 kDa (S.E. = ±4%) corresponding to 
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monomelic HER3-ECD MI (Fig. 6C). There was a slight shift in the peak upon the 
addition of a 1.3-fold molar excess of hrg. The shifted peak has an estimated molecular 
mass of 35 kDa (S.E. = ±3%) (Fig. 6C). To confirm that HER3-ECD 1 - 11 is a monomer, 
we used ultracentrifugation at three different concentrations and three different speeds. 
5 The estimated average molecular weight of HER3-ECD 1-11 , using a group analysis of all 
of the scans, is 35,000 ±210 daltons (Fig. 6D). This is consistent with the expected 
molecular mass (37.5 kDa) of monomelic HER3-ECD 1-11 . 

We used two independent mediods to demonstrate direct binding of hrg to 
HER3-ECD 1-11 : a pull-down assay and surface plasmon resonance (SPR; BIAcore) 

10 analysis. Binding of hrg to HER3-ECD 1-11 was shown using a pull-down assay under 
physiological salt conditions. In this analysis, we measured the amount of the V5-tagged 
HER3-ECD 111 that dissociated from immobilized trx-hrg by the addition of hrg. Using 
the trx-hrg-coupled S-protein-agarose, HER3-ECD 1 ' 11 could be eluted with 1 uM hrg 
(Fig. 7A, lane 4). In contrast, neither PBS (Fig. 7 A, lane 3) nor EGF (Fig. 7A, lane 5) was 

15 effective in eluting HER3-ECD 111 . 

To further confirm that HER3-ECD 1-11 binds hrg, we analyzed the interaction by 
SPR (BIAcore). Trx-hrg fusion protein was immobilized on a BIAcore chip for these 
measurements. SPR measurements using HER3-ECD MI showed binding to the 
immobilized trx-hrg with a calculated equilibrium dissociation constant of 68 nM, 

20 calculated directiy from kon and koff (kon = (1.07 ± 0.1) X 10 5 M" 1 s* 1 ; koff = (7.27 ± 1) X 
10" 3 s _1 ) (Fig. 7b). This interaction could be inhibited by stoichiometric concentrations of 
hrg. Based on these results, we conclude that HER3-ECD 111 containing domains I and II 
of HER3 binds hrg. 
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Example 12: Identification, Generation and Characterization of HER3 Residues 
Involved in Heregulin Binding 

A. Generation of trx-hrg and HER3 Point Mutants 

5 A thioredoxin-heregulin fusion (trx-hrg) was generated and purified as described 

elsewhere (see, e.g. Landgraf et aL (1998) Biochemistry 37(9), 3220-8). The HER3 point 
mutants were generated using the method of oligonucleotide directed mutagenesis 
described by Kunkel (see, e.g. Kunkel, T. A. (1988) Nucleic Acids Symp Ser (19), 43). Table 
1 contains the oligonucleotides used for the mutagenesis. The PCR fragments were 

10 subcloned into the Bgl 2 and EcoR-1 site of HER3 in (residues 20-329) and HER3 UV 
(residues 20-643) in the pMT/BiP/V5-His-A expression vector (Invitrogen, Carlsbad, 
CA) and confirmed by sequencing. The mutants were expressed and purified as 
described elsewhere (see, e.g. Singer et al. (2001) / Biol Chem 276(47), 44266-74). 

15 B. Quantitation of the HER3 Point Mutants 

The concentration of the mutants were determined by analyzing the samples by 
SDS-PAGE on a 4-15% gradient gel (Biorad) followed by Western Blotting and 
chemiluminescence using a monoclonal antibody directed against the V-5 epitope 
conjugated to horse radish peroxidase (Invitrogen). Each mutant was run in triplicate 

20 and compared to a known concentration of HER3 I_U (see, e.g. Singer et al. (2001) J Biol 
Chem 276(47), 44266-74). 

C. Binding Assays, Mass Spectrometry and CD Analysis: 

The gel shift assay, "pull down" assay, and surface plasmon resonance 
25 measurements were performed as described elsewhere (see, e.g. Singer et aL (2001) / Biol 
Chem 276(47), 44266-74). Matrix-assisted Laser Desorption Ionization (MALDI) time- 
of-flight mass spectrometry was performed as described elsewhere (see, e.g. Singer et al. 
(2001) / Biol Chem 276(47), 44266-74). The CD analysis was performed using a circular 
dichroism instrument from AVIV (Lakewood, NJ) with a 0.1 mM pathlength. The scan 
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was performed from 260nM to 195nM every 1.0 nM with 1.50 nM between scans and 
3.0 seconds to collect each point 

D. A mutation of residue 64 in HERJ to alanine abolishes hengulin binding 
5 HER3 I -"E64A a decreased affinity for heregulin binding. To locate the region 

of domain I that is critical for hrg binding, we expressed a series of point mutants in 
domain I of HER3 MI and assayed for hrg binding by two independent methods: a pull 
down assay and BIAcore analysis. To determine whether the mutants of HER3 Ul are 
capable of binding hrg, we used a "pull down" assay in which we measured the HER3 1 * 11 

10 that could be dissociated from immobilized thioredoxin-heregulin (trx-hrg) by the 
addition of hrg. Using trx-hrg coupled S-protein agarose, HER3 1 " 11 was eluted with hrg 
and detergent (Figure 10A, lanes 3 and 5). No HER3 111 m4A could be recovered by 
either elution witii hrg or detergent (Figure 10A, lanes 4 and 6) This provides evidence 
that HER3 WI ^ does not bind hrg. 

15 To further confirm that HER3 1 " n E64A does not bind hrg, we analyzed the 

interaction by surface plasmon resonance (SPR, BIAcore). Trx-hrg fusion protein was 
immobilized on a BIAcore chip for these measurements. We have previously shown 
that HER3 1 " 11 showed binding to immobilized trx-hrg with a calculated equilibrium 
dissociation constant of 68 nM using SPR measurements (see, e.g. Singer et al. (2001) / 

20 Biol Chem 276(47), 44266-74). HER3 WI bma had a Kd of 95 nM, HER3 1-11 ™ 0A had a 
Kd of 15 nM, (Table 2) HER3 1 - 11 E64A had no detectable binding above 500 nM to the 
immobilized hrg on the BIAcore chip. (Figure 10B). All other mutants had a Kd similar 
to wild type. These results provide evidence that residues 44, 64, and 110 are in the 
heregulin binding region of HER3, and that residue 64 is critical for hereg ulin binding. 

25 The residue at position 51 was also mutated to alanine because it could be 

protected by hrg from proteolytic cleavage, but was not direcdy in the binding footprint 
(Singer, 2001). HER3 HI ^ had the same Kd as HER3 HI (Table 2). This provides 
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evidence that residue 51 contributes no binding energy direcdy to HER3-heregulin 
interactions. 

To show that the decreased binding affinity of HER3 HI ^> 4A is not a result of 
proteolysis or mis folding, we performed mass spectrometry and CD analysis of HER3 MI 
5 E64A . The molecular mass of HER3 rir ™ A is between 38.4 kDa and 39.8 kDa which is 
comparable to the molecular mass of HER3 WI (38.5 and 40 kDa) (Figure 11A). This 
provides evidence that the polypeptide chain of HER3 1 " 11 E64A is intact. The CD spectra 
of HER3*- n E64A and HER3 1 " 11 were essentially identical between 210nM and 260nM 
providing evidence that diey have the same secondary structure. (Figure 11B). The 

10 fluctuations in the signal between 190nM and 210nM in HER3 WI ™ h are probably due 
to aggregation. These results show that a mutation at residue 64 in HER3 1 * 11 has litde 
affect on the structure of the protein, but results in decreased affinity for hrg. 

To show that mutation at residue 64 has an effect in the extracellular domain of 
. HER3, we expressed a series of mutants including HER3 MV E64A and HER3 I ~ rv and 

15 assayed for hrg binding by two independent methods: a native gel analysis, and a "pull 
down" assay. To determine the effects of cleavage of HER3 hTV on its ability to bind 
kg* a g e ^ mobility shift assay was performed. In a native gel analysis, HEFJi 1 "™ shows 
an electrophoretic mobility similar to HER3 HVE64A . HER3 I ' IV shows a discrete shift in 
the presence of a 1:1 molar ratio of hrg to HER3 I ' IV . HER3 wvawA does not shift its gel 

20 position in the presence of hrg. (Figure 12A). This finding provides evidence that 
HER3 WVE64A does not bind hrg. 

To confirm that HER3 I " IVE64A has decreased binding affinity for hrg, we used a 
"pull down" assay in which we measured the HER3 IIV and HER3 WVE64A that could be 
dissociated from immobilized thioredoxin-heregulin (trx-hrg) by the addition of hrg. 

25 Using trx-hrg coupled S-protein agarose, HER3 I1V was eluted with hrg and detergent 
(Figure 12B, lanes 6 and 8), but not with PBS (Figure 12B, lane 4). No HER3 I " IV 
could be recovered by elution with PBS, hrg or detergent (Figure 12B, lanes 5, 7 and 9). 
This provided evidence that HER3 I IV does not bind hrg. 
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E. Amino Acids 64 and 1 10 are critical for heregpkn binding 

In this study, we have identified two amino acids in discontinuous segments in 
domain I of HER3 which appear to be critical for heregulin binding. A mutation at 
5 residue 64 seemed to decrease all binding affinity in both recombinant H3 I_rv E64A and 
H3 1-11 E64A . No detectable binding could be measured by gel shift, Biacore, or pull down 
assay. H3 M * BMA seemed to be correcdy folded as determined by CD analysis. These 
experimental results provide evidence that the decrease in binding affinity observed with 
this mutation is probably due to direct effects on interaction between H3 1-11 E64A and 

10 heregulin rather than the consequences of misfolding or proteolysis of the mutant 
protein. A mutation at residue 110 (H3 MI V110A ) caused a 4 fold increase in binding 
affinity. A mutation at residue 44 (H3 WI E44A ) had a 1.5 fold decrease in binding affinity 
A mutation at the protected proteolysis site (H3 l ~ n K51A ) had no effect on binding 
affinity. These results provide evidence that amino acids 64 and 110 are critical for 

15 heregulin binding. 

F. Residues 64 and 110 map to the Hormone binding Footprint in the IGF-1R Structure. 

Residues 64 and 110 are in domain I of HER3 and map to residues H30 and 
L81 in. the hormone binding footprint in the IGF-1R (see, e.g. Garrett et aL (1998) Nature 
20 394(6691), 395-9). (Figure 13). They are within 12A of each other although they lie on 
different strands. They are within 22A of the protected proteolysis site (see, e.g. Singer et 
al. (2001) / Biol Chem 276(47), 44266-74). Their close proximity to each other and the 
proteolysis site provide evidence that both residues may be involved in ligand binding. 

25 G. Residues 64 and 1 10 could form specific interactions with heregulin 

The five N-terrninal residues of heregulin are required for specific interaction 
with HER3 (see, e.g. Jones et aL (1998) / Biol Chem 273(19), 11667-74). A valine lies at 
position 180 and a histidine is at position 178. H3 VI 10 and hrg VI 80 could form 
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hydrophobic interactions. H3 E64 and hrg H180 could form a salt bridge. (Figure 14). 
Mutation of these residues to alanine could disrupt these specific interactions. 

Residue 50 can be protected by hrg from proteolysis but still not be involved 
directly with hrg binding. Our results provide evidence that residue 50 is not direcdy 
5 involved in hrg binding because a mutation at this site had no effect on hrg binding 
affinity. We propose that the N-terminus of hrg binds to residues 64 and 110 and that 
the omega loop blocks the protected proteolysis site (residue 50) by sterically hindering 
interaction with trypsin (Figure 15). This figure also shows how hrg in this orientation 
could fit into the cleft fomied by domains I, II, and III as was proposed by (see, e.g. 

10 Garrett et al. (1998) Nature 394(6691), 395-9) and (see, e.g. Jorissen et al, (2000) Protein 
Science 9, 310-324) for die IGF-1R and IR. 

We have identified two residues in HER3 that appear to be involved in 
hrg binding utilizing results from an alanine scanning mutagenesis of the related insulin 
receptor (see, e.g. Williams et al. (1995) / Biol Chem 270(7), 3012-6) and information from 

15 the hormone binding footprint of the known structure of the Insulin growth Factor 
Receptor (see, e.g. Garrett et aL (1998) Nature 394(6691), 395-9). A multiple sequence 
alignment of these three receptors allowed us to identify residues which were conserved 
among all three and were in or near the hormone binding footprint of the IGF-1R (see, 
e.g. Singer et aL (2001) ] Biol Chem 276(47), 44266-74). This allowed us to choose which 

20 residues would have the highest probability of forming interactions with heregulin. We 
identified two residues that were critical for heregulin binding. We observed that 
complementary residues in heregulin that might form specific interactions with HER3. 
However, while this analysis implies direct molecular interactions between HER3 and 
heregulin, it is not direct proof. A high resolution structure of the HER3-heregulin 

25 complex is required to confirm these interactions. . 
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Tables 



Table 1: Oligonucleotide sequences used for generating the point mutants. 





N-terminal primer 


C-terminal primer 






HER3 I-II (residue 329) 


HER3 MI 


CTA CTC TCT AGA TCC GAG GGC AAC TCT 


TAC CGA TCT AGA TTT CGG 


(residue 20) 


(SEQ ID NO: 9) 


ACA GAG ACC CCC (SEQ ID 






NO: 10) 



HER3 
Mutants 


Plus Strand 


Minus Strand i 


E43A 


GCGATGCTGCGAACCAATACC 
(SEQ ID NO: 11) 


GGTATTGGTTCGCAGCATCGC 
(SEQ ED NO: 12) 1 


N44A 


CGATGCTGAGGCCCAATACCAG 
(SEQ ED NO: 13) 


CTGGTATTGGCGCTCAGCATCG 
(SEQ ED NO: 14) 


K51A 


GACACTGTACGCGCTCTACGAG 
(SEQ ID NO: 15) 


CTCGTAGAGCCGGTACAGTGTC 
(SEQ ID NO: 16) 


E64A 


GGGAACCTTGCGATTGTGCTC 
4 (SEQ ID NO: 17) 


GAGCACAATCGCAAGGTTCCC 
(SEQ ID NO: 18) 


V66A 


CTTGAGATTGCGCTCACGG 
(SEQ ID NO: 19) 


CGTGAGCGCAATCTCAAGGT 
(SEQ ED NO: 20) 


F94A 


CC ATG AATG AAGC CTCT ACTCT AC 
(SEQ ED NO: 21) 


GTAGAGTAGAGCGTTCATTCATGG 
(SEQ ID NO: 22) 


V110A 


GACCCAGGCCTACGATGGGAA 
(SEQ ID NO: 23) 


CTTCCCATCGTAGGCCTGGGTC 
(SEQ ID NO: 24) 



5 



Table 2: Dissociation constants for the binding of heregulin to alanine mutants of 
HER3 UI . Recombinant HER3 alanine mutants were bound to immobilized hrg on the 
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10 



15 



BLAcore chip. HER3 1-11 showed binding to immobilized trx-hrg with a calculated 
equilibrium dissociation constant of 68 nM (see, e ; g. Singer et ai (2001) / Biol Chem 
276(47), 44266-74). The mutants are designated by the residue being mutated followed 
by the number indicating the position in the HER3 sequence, followed by alanine. Thus 
V110A is a mutant in which valine at position 110 has been mutated to alanine. ND > 
SOOnM indicates that no detectable binding was observed above 500nM. These results 
indicate that a mutation of valine at position 110 increased binding affinity 5 fold, while a 
mutation of the glutamate at position 64 nearly abolished binding affinity. 



H3 I-II Mutant 


Kd(nM) 


E43A 


63.0 +/- 6.4 


N44A 


95.5 +/- 15.6 


K51A 


63.0 +/- 6.4 


E64A 


ND > 500 oM 


V66A 


60.5 +/- 6.3 | 


V110A 


15.4 +/- 3.1 



Table 3: Additional amino acid residues in HER3 associated with heregulin binding as 
determined by homology, crystallographic and solvent accessibility criteria. 



2) 



IGF-1R HER3 amino acid residues (SEQ ID NO: 



20 544 30-46 

27-29, 31-36 61-63,65-70 

52-61 88-96 

77-79,81-90 106-109,111-120 

107-115 143-151 

25 1 33-141 169-176 

Residues include those in HER3 that lay on the face of the HER3 domain that includes 
the mutated residues characterized herein. 
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Table 4: Illustrative HER3, Heregulin and HER2 polynucleotide and polypeptide 
sequences: 

HER 3 Polynucleotide Sequence 

5 

ATGAGGGCGAACGACGCTCTGCAGGTGCTGGGCTTGCTTTTCAGCCTGGCCCGGGGCTCCGAGGTGGGC 
AACTCTCAGGCAGTGTGTCCTGGGACTCTGAATGGCCTGAGTGTGACCGGCGATGCTGAGAACCAATAC 
CAGACACTGTACAAGCTCTACGAGAGGTGTGAGGTGGTGATGGGGAACCTTGAGATTGTGCTCACGGGA 
CACAATGCCGACCTCTCCTTCCTGCAGTGGATTCGAGAAGTGACAGGCTATGTCCTCGTGGCCATGAAT 

10 GAATTCTCTACTCTACCATTGCCCAACCTCCGCGTGGTGCGAGGGACCCAGGTCTACGATGGGAAGTTT 
GCCATCTTCGTCATGTTGAACTATAACACCAACTCCAGCCACGCTCTGCGCCAGCTCCGCTTGACTCAG 
CT C AC CGAGATTC TGTC AGGGGGTGTTTATATTG AG AAG AACGATAAG CTTTGTCACATGG AC ACAATT 
GACTGGAGGGACATCGTGAGGGACCGAGATGCTGAGATAGTGGTGAAGGACAATGGCAGAAGCTGTCCC 
CCCTGTCATGAGGTTTGCAAGGGGCGATGCTGGGGTCCTGGATCAGAAGACTGCCAGACATTGACCAAG 

15 ACCATCTGTGCTCCTCAGTGTAATGGTCACTGCTTTGGGCCCAACCCCAACCAGTGCTGCCATGATGAG 
TGTGCCGGGGGCTGCTCAGGCCCTCAGGACACAGACTGCTTTGCCTGCCTO 

GC CTGTGTAC CTCGC TGTCC AC AGC CTC TTGTCTAC AAC AAG CTAACTTTCC AGCTGGAAC C CAATCC C 
CACACCAAGTATCAGTATGGAGGAGTTTGTGTAGCCAGCTGTCCCCATAACTTTGTGGTGGATCAAACA 
TCCTGTGTCAGGGCCTGTCCTCCTGACAAGATGGAAGTAGATAAAAATGGGCTCAAGATGTGTGAGCCT 
20 TGTGGGGGACTATGTCCCAAAGCCTGTGAGGGAACAGGCTCTGGGAGCCGCTTCCAGACTGTGGACTCG 
AG C AACATTGATG GATTTGTGAACTGC AC C AAGAT CCTGGGC AAC CTGGACTTTCTGATCACCGGCCTC 
AATGGAGACCCCTGGCACAAGATCCCTGCCCTGGACCCAGAGAAGCTCAATGTCTTCCGGACAGTACGG 
GAGATCA(^GGTTACCTGAACATCGAGTCCTGGCCGCCCCACATGCAGAA 

TTGACAACCATTGGAGGCAGAAGCCTCTACAACCGGGGCTTCTCATTGTTGATCATGAAGAACTTGAAT 

25 GTC ACATCTCTGGGCTTCCGATC CCTGAAGGAAATTAGTGCTGGGCGTATCTATATAAGTGC CAATAG G 
CAGCTCTG CTAC C ACCACTC TTTG AACTGGACCAAGGTGCTTCGGGGGCCTACGGAAGAGCGACTAGAC 
ATCAAGCATAATCGGCCGCGCAGAGACTGCGTGGCAGAGGGCAAAGTGTGTGACCCACTGTGCTCCTCT 
GGGGGATGCTGGGGCCCAGGCCCTGGTCAGTGCTTGTCCTGTCGAAATTATAGCCGAGGAGGTGTCTGT 
GTGACCCACTGCAACTTTCTGAATGGGGAGCCTCGAGAATTTGCCCATGAGGCCGAATGCTTCTCCTGC 

30 CACCCGGAATGCCAACCCATGGGGGGCACTGCCACATGCAATGGCTCGGGCTCTGATACTTGTGCTCAA 
TGTGCCCATTTTCGAGATGGGCCCCACTGTGTGAGCAGCTGCCCCCATGGAGTCCT 
CCAATCTACAAGTACCCAGATGTTCAGAATGAATGTCGGCCCTGCCATGAGAACTGCACCCAGGGGTGT 
AAAGGACCAGAGCTTCAAGACTGTTTAGGACAAACACTGGTGCTGATCGGCAAAACCCATCTGACAATG 
GCnTTGACAGTGATAGCAGGATTGGTAGTGATTTTCATGATGCTGGGCGGCACTTTTCTCTACTGGCGT 

35 GG GCG CCGG ATTCAGAATAAAAG GGCTATG AGGCGATACTTGGAACGGGGTGAGAG CATAGAGC CTCTG 
GACCC(^GTGAGAAGGCTAA(^y^GTCTTGGCCAGAATCTTCAAAGAGAC^GAGCTAAGGAAGCTTAAA 
GTGCTTGGCTCGGGTGTCTTTGGAACTGTGCACAAAGGAGTGTGGATCCCTGAGGGTGAATCAATCAAG 
ATTCCAGTCTGCATTAAAGTCATTGAGGACAAGAGTGGACGGCAGAGTTTTCAAGCTGTGACAGATCAT 
ATGCTGGCCATTGGCAGCCTGGACCATGCCCACATTGTAAGGCTGCTGGGACTATGCCCAGGGTCATCT 

40 CTGCAGCTTGTCACTCAATATTTGCCTCTGGGTTCTCTGCTGGATCATGTGAGACAACACCGGGGGGCA 
CTGGGGCCACAGCTGCTGCTCAACTGGGGAGTACAAATTGCCAAGGGAATGTACTACCTTGAGGAACAT 
GGTATGGTGCATAGAAACCTGGCTGCCCGAAACGTGCT 

GATTTTGGTGTGGCTGACCTGCTGCCTCCTGATGATAAGCAGCTGCTATACAGTGAGGCCAAGACTCCA 
ATTAAGTGGATGGCCCTTGAGAGTATCCACTTTGGGAAATACACACACCAGAGTGATGTCTGGAGCTAT 
45 GG TGTGAC AGTTTG G G AGTTG ATGACCTTCGGGGCAGAG C C CT ATG C AGGG CT ACG ATTGG C TG AAG TA 
CCAGACCHX3CTAGAGAAGGGGGAGCGGTTGGGACAGCCCCAGATCTGCACAATTC 

ATGGTCAAGTGTTGGATGATTGATGAGAACATTCGCCCAACCTTTAAAGAACTAGCC7UVTGAGTTCACC 
AGGATGGCCCGAGACCCACCACGGTATCTGGTCATAAAGAGAGAGAGTGGGCCTGGAATAGCCCCTGGG 
CCAGAGCCCCATGGTCTGACAAACAAGAAGCTAGAGGAAGTAGAGCTGGAGCCAGAACTAGACCTAGAC, 
50 CTAGACTTGGAAGC^GAGGAGGACAACCnXSGCAACCACCACACTGGGCTCCGCCCTCAGCCT 

GGAACACTTAATCGGCCACGTGGGAGCCAGAGCCTTTTAAGTCCATCATCTGGATACATGCCCATGAAC 
CAGGGTAATCTTGGGGGGTCTTGCCAGGAGTCTGGAGTTTC 

GTCTCTCTACACCCAATGCCACGGGGATGCCTGGCATCAGAGTCATCAGAGGGGCATC 
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G AGGCTG AG CTC CAGG AGAAAGTGTC AATGTGTAGAAGC CGG AG C AGG AG CC GG AGCC C ACG GC C ACGC 
GGAGATAGCGCCTACCATT CC C AGCGCCACAGTCTGCTG ACTCCTGTTACCC CACTCTC CCCAC C CGGG 
TTAGAGGAAGAGGATGTCAACGGTTATGTCATGCCAGATACACACCTCAAAGGTACTCCCTCCTCCCGG 
GAAGGCACCCTTTCTTCAGTGGGTCTCAGTTCTGTCCTGGGTACTGAAGAAGAAGATGAAGATGAGGAG 
5 TATG AATACATGAACCGG AGGAGAAGGCACAGTC G AC CTC ATCC CC CTAGGCC AAGTTCC CTTG AGG AG 
CTGGGTTATGAGT AC ATGGATGTGGGGTCAG ACCTC AGTG CCTCTCTGGGC AG C AC AG AG AGTTG CC C A 
CTCCACCCTGT AC C C ATCATG C C CACTG CAGG CAC AACTC C AG ATG AAGACTATGAATAT ATGAATCGG 
CAACGAGATGGAGGTGGTC CTGGGGGTG ATTATGC AG CC ATG GG GGC CTGC C C AG C AT CTG AGC AAG GG 
TATGAAGAGATC^AGAGCTTTTGAGGGGCCTGGACATGAGGCCCCCCATGTCCATTATGCCCGCCTAAAA 
10 ACTCTACGTAGCTTAGAGGCTACAGACTCTGCCTTTGATAACCCTGATTACTGGCATAGCAGGCTTTTC 
CC C AAGG CTAATGCC C AG AGAACG SEQ ID NO: 1 



15 HER3 Polypeptide Sequence 

MRAITOALQVLGLLFSI^G SEVGNSQAVCPGTLNGLSVTGDAENQYQTLYKIjYERCEVVMGNLEIVLTG 
HNADL S FLQ W I RE VTG YVL VAMNE FS TL P L P NIjRWRGTQVYDGKF A I F VMLN YNTNS S HALRQLRLTQ 
LTEILSGGVYIEK1TOKLCHMDTIDWRDIVRDRDAEIVVKDNGRSCPPCHEVCKGRCWGPGSEDCQTLTK 
20 TICAPQCNGHCFGPNPNQCCHDECAGGCSGPQDTDCFACRHFNDSGACVPRCPQPLVYNKLTFQLEPNP 
HTKYQYGGVCVASCPHNFVVDQTSCVRACPPDKMEVDKNGL^ 

SNIDGFVNCTKILGmDFLITGLNGDPWHKIPALDPEKliNVFRTVREITGYLNIQSWPPHMHNFSVFS^ 
LTTIGGRSLYTOGFSLLIMKNLim'SLGFRSLKEISAGRIYISATOQLCYHHSLNWTKVLRGPTEERLD 
IKHNRPRRDCVAEGKVCDPLCSSGGCWGPGPGQCLSCROTSRGGVCVTHCOTLNGEPREFAHEAECFSC 
25 HPECQPMGGTATMGSGSDTCAQCAHFRI)GPHWSSCPHGVLGAKGPIYKYPDVQNECRPCHENCTQGC 
KGPELQDCLGQTLVLIGKTHIjTMALTVIAGLWIFMMLGGTFLYWRGRRIQNKRA^ 
DPSEKANICVLARI FKETELRKIjKVI^SGWGTVHKGVWI PEGES I KIPVCIKVIEDKSGRQS FQAVTDH 

mlaigsldhahiwllglcpgsslqlwqylplgslldhtoqhrg 

gntvhrniaarnvllkspsqvqvadfgvadllppddkqllys^^ 
30 gvtwelmtfgaepyaglrlaevpdllekgerlaqpqictidwmvw^ 

rmardpprylvikresgpgiapgpephgltnkkleevelepeldldldl 

gtlnrprgsqsllspssgympmnqgnlggscqesavsgssercprpvslhpmprgclasesseghvtgs 

eaelqekvsmcrsrsrsrsprprgdsayhsqrhslltpvtplsppgleeedvngyvmpdthlkgtpssr 

egtlssvglssvlgteeededeeyeymnrrrrhspphpprpssleelgyeymdvgsdiisaslgstqscp 
3 5 lhpvp imptagttpdedyeymnrqrdgggpggdyaamgacpaseqgyeemrafqgpghqaphvhyarlk 

tlrsleatdsafdnpdywhsrlfpkanaqrt seq id no: 2 



Heregulin Polynucleotide Sequence 

40 

ATGTCCGAGCGCAAAGAAGGCAGAGGCAAAGGGAAGGGC^GAAGAAGGAGTOAGGCTCCGGCAAGAAG 
CCGGAGTCCGCGGCGGGC AGC CAGAGC CCAGCCTTG CCTCCCCAATTGAAAGAGATGAAAAGCCAGGAA 
TCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTC 
TGGTTCAAGAATGGGAATGAATTGAATCGAAAAAACAAACCACAAAATATCAAGATACAAAAAAA^ 

45 GGGAAGTCAGAACTTCGGATTAACAAAGCTVrCACTGGCT 

AGCAAATTAGGAAATGACAGTGCCTCTGCCAATATCACCATCGTGGAATO^^ 
ATGCGAGCCTCAACTGAAGGAGCATATGTGTCTTCAGAGTCTCCCATTAGAATATC^ 
GGAGCAAATACTTCTTCATCTACATCTACATCCACCACTGGGACAAGCCATCTTG^ 
AAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTTCAAACCCC 

50 TACTTGTGCAAGTGCCCAAATGAGTTTACTGGTGATCX3CTGCCAAAACT 

AAGCATCTTGGGATTGAATTTATGGAGGCGGAGGAGCTGTACCAGAAGAGAGTGCTGACCATAACCGGC 
ATCTGCATCGCCCTCCTTGTGGTCGGCATCATGTGTGTGGTGGCCTACTC 
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AAAAAG CTGC ATG ACCGTCTTCGG C AGAGCCTTCGGTCTG AACG AAAGAATATGATG AAC ATTG CC AAT 
GGGCCTCACCATCCTAACCCACCCCCCGAGAATGTCCAGCTGGTGAATCAATACGTATCTAAAAACGTC 
ATCTC CAGTG AG C ATATTGTTGAG AG AG AAGCAGAGACATC C TTTT C C ACC AGTCACT AT ACTTC C AC A 
GCCC ATC ACT CCACTACTGTC AC C C AGACTCCT AG C CAC AGCTGG AG CAACGG AC AC ACTGAAAGC ATC 
5 CTTTCCGAAAGCCACTCTGTAATCGTGATGTCATC CGTAGAAAACAGTAGGC ACAGC AG CCCAACTGGG 
GGCCC AAG AGG ACGTCTT AATGG C ACAGG AGG CC CT CGTG AATGTAACAGCTTCCTC AGG C ATG CC AG A 
GAAAC C C CTG ATTC CT AC CG AG ACTCTC CTCATAGTGAAAGGTATGTGTCAGC C ATGAC CACCC CGGCT 
CGTATGTCACCTGTAGATTTCCACACGCCAAGCTCCCCCAAATCGCCCCCTTCGGAAATGTCTCCACCC 
GTGTCCAGCATGACGGTGTCCATGCCTTCCATGGCGGTCAGCCCCTTCATGGAAGAAGAGAGACCTCTA 

10 CTTCTCGTGACACCACCAAGGCTGCGGGAGAAGAAGTTTGACCATCACCCTCAGCAGTTCAGCTCCTTC 
CACCACAACCCCGCGCATGACAGTAACAGCCTCCCTGCTAGCCCCTTGAGGATAGTGGAGGATGAGGAG 
TATGAAACGACCCAAGAGTACGAGCCAGCCCAAGAGCCTGTTAAGAAACTCGCCAATAGCCGGCGGGCC 
AAAAGAACCAAGCCCAATGGCCACATTGCTAACAGATTGGAAGTGGACAGCAACACAAGCTCCCAGAGC 
AGTAACTCAGAGAGTG AAAC AGAAG ATG AAAGAGTAGGTGAAG AT ACGCCTTTC CTGGG C ATAC AG AAC 

15 CCCCTGGCAGCCAGTCTTGAGGCAACACCTGCCTTCCGCCTGraCTGACAGCAGGACTAACCCAGCAGGC 
CG CTTCTCGAC AC AGGAAG AAATC C AGGC C AGGCTGT CT AGTGTAATTGCTAACC AAGAC C CTATTGCT 
GTA SEQ ID NO: 3 



20 Heregulin Polypeptide Sequence 

MSERKEGRGKGKGKKKERGSGKKPESAAGSQSPALPPRLKEMKSQESAAGSKLVLRCETSSEYSSLRFK 
WFKNGNELNRKNKPQNIKIQKKPGKSELRINKASIiADSGEYMC 

MPASTEGAWSSESPIRISVSTEGANTSSSTSTSTTGTSHLVKCAEKEKTFCWGGECFMVKDLSNPSR 

25 YLCKCPNEFTGDRCQfcnfVMASFYKHLGIEFM^ 

KKLHDRLRQS LRS ERNtWMNI ANGPHHPNPPPENVQLVNQYVS KNVI S S EHI VEREAETS FSTSHYTST 
AHHSTTVTQTPSHSWSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRLNGTGGPRECNSFLRHAR 
ETPDSYRDSPHSERYVSAMTTPARMSPVDFHTPSSPKSPPSEMSPPVSSMTVSMPSMAVSPFMEEERPL 
LLVTPPRLREKXFDHHPQQFSSFHHNPAHDSNSLPASPLRIVEDEEYETTQEra 

30 KRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLGIQNPLAASLEATPAFRLADSRTNPAG 
RFSTQEEIQARLSSVIANQDPIAV SEQ ID NO: 4 



HER2 Polynucleotide Sequence 

35 

ATGGAGCTGGCGGCCTTGTGCCGCTGGGGGCTCCTCCTCGCCCTCTTGCCCCCCGGAGCCGCGAGCACC 
CAAGTGTGCACCGGCACAGACATGAAGCTGCGGCTCCCTGCCAGTCCCGAGACCCACCTGGACATGCTC 
CGCCACCTCTACCAGGGCTGCCAGGTGGTGCAGGGAAACCTGGAACTCACCTACCTGCCCACCAATGCC 

40 AGCCTGTCCTTCCTGCAGGATATCCAGGAGGTGCAGGGCTACGTGCTCATCGCTCACAACCAAGTGAGG 
CAGGTCCCACTGC AGAGGCTGCGG ATTGTGCGAGG CACCC AG CTCTTTGAGGACAACTATG CC CTGGCC 
GTGCTAGAC AATGGAGACCCGCTG AACAATACCACCCCTGTCACAGGGGCCTCCCCAG GAGGC CTGCGG 
GAGCTGCAGCTTCGAAGCCTCACAGAGATC1TCAAAGGAGGGGTCTTGATCCAGCGG 
TGCTAC C AGGAC ACGATTTTGTGG AAGGACATCTTCC AC AAGAACAACC7VGCTGG CTCTC AC ACTGATA 

45 GA(^CCAACCGCTCTCGGGCCTGCC^CCCCTGTTCTCCGATGTGTAAGGGCTCCCGCTGCTGGGGAGAG 
AGTTCTGAGGATOGTCAGAGCCTGACGCGCACTGTCT^ 

CTGCCCACTGACTGCTGCCATGAGCAGTGTGCTGCCGGCTGCACGGGCCCCAAGCACT 
GCCTGCCTCCACTTCAACCACAGTGGCATCTGTGAGCTGCACTO 
GACACGTTTGAGTCCATGCCCAATCCCGAGGGCCGGTATACATTCGGCGCCAGCTG 
50 CCCTACAACTACCTTTCTACGGACGTGGGATCCTGCACCCTCGTCTC 

ACAGCAGAGGATGGAACACAGCGGTGTGAGAAGTGCAGCAAGCCCTGTGCCCGAGTGTGCTATGGTCTG 

GGGATGGAGCACTTGCGAGAGGTGAGGGCAGTTACCAGTGCCAATATC 

AAGATCTTTGGGAGCCTGGCATTTOTGCCGGAGAGCTTTGAT^ 
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CTC CAGCCAGAGCAG CTC C AAGTGTTTGAGACTCTGGAAGAGATC AC AGGTT ACCTATAC AT CTC AG CA 
TGG C CGG ACAGC CTG CCTG ACCTC AGCGTCTTC CAG AACCTG C AAGTAATCCGGGGACGAATTCTGCAC 
AATGGCGCCTACTCGCTGACCCTGCAAGGGCTGGGCATCAGCTGGCTGGGGCTGCGCTCACTGAGGGAA 
CTGGGCAGTGGACTGGCCCTCATCCACCATAACACCCACCTCTGCTTCGTGCACACGGTGCCCTGGGAC 
5 CAGCTCTTTCGGAACCCGCACCAAGCTCTGCTCCACACTGCCAACCGGCCAGAGGACGAGTGTGTGGGC 
GAGGGCCTGGCCTGCCACCAGCTGTGCGCCCGAGGGCACTGCTGGGGTCCAGGGCCCACCCAGTGTGTC 
AACTGCAGCCAGTTCCTTCGGGGCCAGGAGTGCGTGGAGGAATGCCGAGTACTGCAGGGGCTCCCCAGG 
G AGTATGTGAATGC CAGGCACTGTTTGCCGTGCCAC CCTGAGTGTCAG CCCC AG AATGG CTCAGTGAC C 
TGTTTTGGAC CGGAGGCTGACC AGTGTGTGGCCTGTGCCC ACTATAAG GACC CTCC CTTCTGCGTGG CC 

10 CG CTG CC C CAGCGGTGTG AAACCTGACCTCTCCT AC ATGCC C ATCTGGAAGTTTCCAG ATG AGGAGGG C 
GC ATGCC AGCCTTG CCCC ATCAACTGC AC CC ACT CCTGTGTGGAC CTG GATG AC AAGGGCTG CCCCG CC 
GAGCAGAGAGCCAGCCCTCTGACGTCCATCGTCTCTGCGGTGGTTGGCATTCTGCTGGTCGTGGTCTTG 
GGGGTGGT CTTTGGGATC CTC ATC AAGCG ACG GCAGCAGAAGATCCGG AAGTACACGATGCG G AGACTG 
CTGCAGGAAACGGAGCTGGTGGAGCCGCTGACACCTAGCGGAGCGATGCCCAACCAGGCGCAGATGCGG 

15 ATCCTGAAAGAGACGGAGCTGAGGAAGGTGAAGGTGCTTGGATCTGGCGCTTTTGGCACAGTCTACAAG 
GGCATCTGGATCCCTGATGGGGAGAATGTGAAAATTCCAGTGGCCATCAAAGTGTTGAGGGAAAACACA 
TCC C CCAAAGC C AACAAAGAAATCTTAGACGAAG CATACGTGATGGCTGGTGTGGGCTCCCCAT ATGTC 
TCCCGCCTTCTGGGCATCTGCCTGACATCCACGGTGCAGCTX3GTGACAC^GCTTATGCCCTATGGCTGC 
CTCTTAGACCATGTCCGGGAAAACCGCGGACGCCTGGGCTCCCAGGACCTGCTGAACTGGTGTATGCAG 

20 ATTGCCAAGGGGATGAGCTACCTGGAGGATGTGCGGCTCGTACACAGGGACTTGGCCGCTCGGAACGTG 
CTGGTCAAGAGTCCCAACCATGTCAAAATTACAGACTTCGGGCTGGCTCGGCTGCTGGACATTGACGAG 
ACAGAGTACCATGCAGATGGGGGCAAGGTGCCCATCAAGTGGATGGCGCTGGAGTCCATTCTCCGCCGG 
CGGTTC AC CCAC C AGAGTG ATGTGTG GAGTTATGGTGTG ACTGTGTGGGAGCTGATGACTTTTGGGGC C 
AAACCTTACGATGGGATCCCAGCCCGGGAGATCCCTGACCTGCTGGAAAAGGGGGAGCGGCTGCCCCAG 

25 C CC CCC ATCTGCAC C ATTG ATGTCTAC ATG ATC ATGGT C AAATGTTGG ATGATTG ACTCTGAATGTCGG 
CCAAGATTCCGGGAGTTGGTGTCTGAATTCTCCCGCATGGCCAGGGACCCCCAGCGCTTTGTGGTCATC 
CAGAATGAGGACTTGGGCCCAGCCAGTCCCTTGGACAGCACCTTCTACCGCTCACTGCTGGAGGACGAT 
GACATGGGGGACCTGGTGGATGCTGAGGAGTATCTGGTACCCCAGCAGGGCTTCTTCTGTCCAGACCCT 
GCCCCGGGCGCTGGGGGCATGGTCCACCACAGGCACCGCAGCTCATCTACCAGGAGTGGCGGTGGGGAC 

30 CTGACACT AGGGCTGGAGCC CTCTGAAGAGG AGGCC CCCAGGTCTCCACTGG C ACC CTC CGAAGGGG CT 
GGCTCCGATGTATTTGATGGTGACCTGGGAATGGGGGCAGCCAAGGGGCTGCAAAGCCTCCCCACACAT 
GACCCCAGCCCTCTACAGCGGTACAGTGAGGACCCCACAGTACCCCTGCCCTCTGAGACTGATGGCTAC 
GTTGCCCCCCTGACCTGCAGCCCCCAGCCTGAATATGTGAACCAGCCAGATGTTCGGCCCCAGCCCCCT 
TCGCCCCGAGAGGGCCCTCTGCCTGCTGCCCGACCTGCTGGTGC(^CTCTGGAAAGGGCCAAGACTCTC 

35 TCCCCAGGGAAGAATGGGGTCGTCAAAGACGTTTTTGCCTTTGGGGGTGCCGTGGAGAACCCCGAGTAC 
TTGACACCCC^GGGAGGAGCTGCCCCTCAGCCCCACCCTCCTCCTGCCTTCAGCCC^GCCTTCGACAAC 
CTCTATTACTGGGACCAGGACCCACCAGAGCGGGGGGCTCOVCCCAGCACCTTCAAAGGGACACCTACG 
GCAGAGAACCCAGAGTACCTGGGTCTGGACGTGCCAGTG SEQ ID NO: 5 

40. 

HER 2 Polypeptide Sequence 

MELAALCRWGLLLALLPPGAASTQVCTGTDMKl^PASPEl^^ 

SLSFLQDIQEVQGYVLIAHNQWQWLQRLRIVRGTQLFEDNY^ 
45 ELQLRSLTE ILKGGVL IQRNPQLC YQDTI LWKD I FHKNNQLALTL IDTNRSRACHPCS PMCKGS RCWGE 

S S EDCQ S LTRT VC AGGCAR CKG PL PTDCCHEQ CAAGCTG PKHS DCL ACLHFNHS G I C ELHCP AL VT YNT 

DTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAEDGTQRCEKCSKPCARVCYGL 

GMEHLREVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDPASNTAPLQPEQLQVFETLEEITGYLYISA 

WPDSLPDLSVFQNLQVIRGRILHNGAYSLTLQGLGISWLGLRSIiRELGSGLALIHHNT 
50 QLFRNPHQALLHTANRPEDECVGEGI^CHQLCARGHCWGPGPTQCW 

EYWARHCLPCHPE(^PQNGSVTCFGPEADQCVAC7VHYKDPPF^^ 

ACQPCPINCTHSCVDLDDKGCPAEQRASPLTSIVSAWG 

LQETELVEPLTPSGAMPNQAQ^ILKETELRKVKVLGSGAFGTWK^^ 

SPKANKEILDEAYVMAGVGSPYVSRLLGICLTSWQLWQM^ 



81 



WO 03/011897 



PCT/US02/23963 



lAKGMSYLEDVRLVHRDLAARNVLVKS^ 
RFTOQSDWSYGVTWELOTFGAKPYDGIPAREIP 

PRFRELVSEFSRMARDPQRFWIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGFFCPDP 
APGAGGIWHHRHRSSSTRSGGGDLTLGLEPSEEEAPRSPLAPSEGAGSDVFDGDLGMGAAKGLQSLPTH 
5 DPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVRPQPPSPREGPLPAARPAGATLERAKTL 
SPGKNGWKDVFAFGGAVENPEYLTPQGGAAPQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPT 
AENP E YLGItD VP V SEQ ID NO: 6 
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WHAT IS CLAIMED IS: 

1. A non-naturally occurring HER3 variant polypeptide consisting 
essentially of amino acid 20 to amino acid 329 of HER3 polypeptide sequence of SEQ 

5 ID NO: 2; wherein the HER3 variant polypeptide: 

(a) specifically binds to the heregulin polypeptide of SEQ ID NO: 4; and 

(b) exhibits an impaired ability to interact with the HER2 polypeptide of SEQ ID 

NO: 6. 

2. A non-naturally occurring HER3 variant polypeptide consisting 
10 essentially of amino acid 19 to amino acid 329 of HER3 polypeptide sequence of SEQ 

ID NO: 2; wherein the HER3 variant polypeptide has the ability to inhibit the interaction 
between wild type HER3 having the polypeptide sequence of SEQ ED NO: 2 and 
heregulin having the polypeptide sequence of SEQ ID NO: 4. 

3 The HER3 variant polypeptide of claim 1 further comprising one or 
15 more amino acid substitutions at the following residue position(s) in SEQ ID NO: 2: 
E43; N44; K51; E64; V66; and VI 10. 

4. A HER3 variant polypeptide comprising an amino acid sequence which 
differs from the native sequence HER3 polypeptide sequence of SEQ ID NO: 2 and has 
one or more of amino acid substitutions at the following residue position(s) in SEQ ID 

20 NO: 2: E43; N44; K51; E64; V66; and V110. 

5. An isolated nucleic acid comprising a nucleotide sequence encoding the 
HER3 variant of claim 4. 

6. A vector comprising the nucleic acid of claim 5. 

7. A host cell comprising the vector of claim 6. 

25 8. The host cell of claim 7 wherein said host cell is E. coU, 

9. A method of making HER3 variant polypeptide, comprising the steps of: 
providing a host cell comprising the vector of claim 6; (b) providing culture media; (c) 
. oilmring the host cell in the culture media under conditions sufficient to express the 
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HER3 variant polypeptide; (d) recovering the HER3 variant polypeptide from the host 
cell or culture media; and (e) purifying the HER3 variant polypeptide. 

10. The HER3 variant polypeptide of claim 4, wherein the HER3 variant 
polypeptide is conjugated or linked to one or more polyol groups. 
5 11. The HER3 variant polypeptide of claim 4, wherein the HER3 variant 

polypeptide has a substitution at VI 1 0. 

12. The HER3 variant polypeptide of claim 11, wherein the HER3 variant 
polypeptide has an alanine substitution at VI 10. 

13. The HER3 variant polypeptide of claim 4, wherein the HER3 variant 
10 polypeptide has a substitution at E64. 

14. The HER3 variant polypeptide of claim 4, wherein the HER3 variant 
polypeptide comprises a soluble, extracellular domain HER3 polypeptide. 

15. The isolated HER3 variant polypeptide of claim 1, wherein the HER3 
variant polypeptide does not have any portion of the amino acid sequence consisting of 

15 amino acid 330 to amino acid 1342 of HER3 polypeptide sequence of SEQ ID NO: 2. 

16. A method of identifying a compound which specifically binds the 
heregulin binding domain in a HER3 variant polypeptide comprising amino acid 20 to 
amino acid 329 of HER3 polypeptide sequence of SEQ ID NO: 2 comprising contacting 
the HER3 variant polypeptide with a test compound under conditions favorable to 

20 binding and determining whether the test compound specifically binds to the HER3 ■ 
variant polypeptide such that a compound which binds to the HER3 variant polypeptide 
can be identified 

17. The method of Claim 16, further comprising determining whether the 
test compound inhibits the heregulin induced tyrosine kinase activity associated with a 

25 HER3 polypeptide having the sequence shown in SEQ ID NO: 2 comprising the steps 
of: 

contacting mammalian cells that express HER3 receptor with heregulin in the 
presence and absence of the test compound; and 
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monitoting the mammalian cells for the tyrosine kinase activity associated with 
the HER3 polypeptide, wherein an inhibition in tyrosine kinase activity in the presence 
of the test compound as compared to the absence of the test compound identifies the 
test compound as an inhibitor of heregulin induced tyrosine kinase activity. 
5 18. The method of Claim 16, further comprising determining whether the 

test compound enhances the heregulin induced tyrosine kinase activity associated with a 
HER3 polypeptide having the sequence shown in SEQ ID NO: 2 comprising the steps 
of: 

contacting mammalian cells that express HER3 receptor with heregulin in the 

10 presence and absence of the test compound; and 

monitoring the mammalian cells for the tyrosine kinase activity of the HER3 
polypeptide, wherein an increase in tyrosine kinase activity in the presence of the test 
compound as compared to the absence of the test compound identifies the test 
compound as an enhancer of heregulin induced tyrosine kinase activity. 

15 19. A method of determining whether a test compound modulates the 

interaction between heregulin polypeptide of SEQ ID NO: 4 and a HER3 variant 
polypeptide comprising amino acid 20 to amino acid 329 of HER3 polypeptide sequence 
of SEQ ID NO:2 comprising the steps of: 

(a) contacting the HER3 variant polypeptide with a test compound under 
20 conditions favorable to binding; 

(b) contacting the HER3 variant polypeptide of step (a) with heregulin under 
conditions favorable to binding; 

(c) comparing the binding interaction between the HER3 variant polypeptide and 
heregulin of step (b) with the binding interaction between HER3 variant polypeptide and 

25 heregulin in the absence of the test compound such that a compound which modulates 
the interaction between heregulin and the HER3 variant polypeptide can be identified 

20. A method of inhibiting the interaction between a heregulin polypeptide 
having the sequence shown in SEQ ID NO: 4 and HER3 polypeptide having the 
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sequence shown in SEQ ID NO: 2 comprising exposing the heregulin polypeptide to a 
non-naturally occurring HER3 variant polypeptide comprising amino acid 20 to amino 
acid 329 of HER3 polypeptide sequence of SEQ ID NO: 2; wherein the HER3 variant 
polypeptide specifically binds to the heregulin polypeptide of SEQ 3D NO: 4; and 
5 exhibits an impaired ability to interact with the HER2 polypeptide of SEQ ID NO: 6. 

21. The method of claim 20, wherein the HER3 variant polypeptide has one 
or more of amino acid substitutions at the following residue position(s) in SEQ ID NO: 
2:E64andV110. 

22. The method of claim 20, wherein the HER3 variant polypeptide has an 
1 0 amino acid substitution at VI 1 0. 

23. The method of claim 20, wherein the amino acid substitution is V91A. 

24. The method of claim 20, wherein the HER3 variant polypeptide is 
conjugated or linked to one or more polyol groups. 

25. The method of claim 24, wherein the polyol group is polyethylene glycol. 

15 
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SEQUENCE LISTING 

<110> The Regents of the University of California et al . 

<120> METHODS AND MATERIALS FOR CHARACTERIZING 

AND MODULATING INTERACTIONS BETWEEN HEREGULIN AND HER3 



<130> 30448103WOU1 

<150> 60/308,431 
<151> 2001-07-27 

<160> 24 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 4026 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . ... (4026) 

<400> 1 

at 9 a 99 9 C 9 aac gac get ctg cag gtg etg ggc ttg ctt ttc age ctg 48 

Met Arg Ala Asn Asp Ala Leu Gin Val Leu Gly Leu Leu Phe Ser Leu 

15 10 15 

gec egg ggc tec gag gtg ggc aac tct cag gca gtg tgt cct ggg act 96 
Ala Arg Gly Ser Glu Val Gly Asn Ser Gin Ala Val Cys Pro Gly Thr 
20 25 30 

ctg aat ggc ctg agt gtg acc ggc gat get gag aac caa tac cag aca 144 
Leu Asn Gly Leu Ser Val Thr Gly Asp Ala Glu Asn Gin Tyr Gin Thr 
35 40 45 

ctg tac aag etc tac gag agg tgt gag gtg gtg atg ggg aac ctt "gag 192 
Leu Tyr Lys Leu Tyr Glu Arg Cys Glu Val Val Met Gly Asn Leu Glu 
50 55 60 

att gtg etc acg gga cac aat gec gac etc tec ttc ctg cag tgg att 240 
He Val Leu Thr Gly His Asn Ala Asp Leu Ser Phe Leu Gin Trp He 
65 70 75 80 

cga gaa gtg aca ggc tat gtc etc gtg gec atg aat gaa ttc tct act 288 
Arg Glu Val Thr Gly Tyr Val Leu Val Ala Met Asn Glu Phe Ser Thr 
85 90 95 

eta cca ttg ccc aac etc cgc gtg gtg cga ggg acc cag gtc tac gat 336 
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Leu Pro Leu Pro Asn Leu Arg Val Val Arg Gly Thr Gin Val Tyr Asp 

100 105 110 

ggg aag ttt gcc ate ttc gtc atg ttg aac tat aac acc aac tec age 3 84 

Gly Lys Phe Ala He Phe Val Met Leu Asn Tyr Asn Thr Asn Ser Ser 

115 120 125 

cac get ctg cgc cag etc cgc ttg act cag etc acc gag att ctg tea 432 

His Ala Leu Arg Gin Leu Arg Leu Thr Gin Leu Thr Glu He Leu Ser 

130 * 135 140 

999 99 fc 9tt ta t att gag aag aac gat aag ctt tgt cac atg gac aca 480 

Gly Gly Val Tyr He Glu Lys Asn Asp Lys Leu Cys His Met Asp Thr 

145 150 155 160 

att gac tgg agg gac ate gtg agg gac cga gat get gag at a gtg gtg 528 

He Asp Trp Arg Asp He Val Arg Asp Arg Asp Ala Glu He Val Val 

165 170 175 

aag gac aat ggc aga age tgt ccc ccc tgt cat gag gtt tgc aag ggg 576 

Lys Asp Asn Gly Arg Ser Cys Pro Pro Cys His Glu Val Cys Lys Gly 

180 185 190 

cga tgc tgg ggt cct gga tea gaa gac tgc cag aca ttg acc aag acc 624 

Arg Cys Trp Gly Pro Gly Ser Glu Asp Cys Gin Thr Leu Thr Lys Thr 

195 200 205 

ate tgt get cct cag tgt aat ggt cac tgc. ttt ggg ccc aac ccc aac 672 

He Cys Ala Pro Gin Cys Asn Gly His Cys Phe Gly Pro Asn Pro Asn 

210 215 220 

cag tgc tgc cat gat gag tgt gcc ggg ggc tgc tea ggc cct cag gac 720 

Gin Cys Cys His Asp Glu Cys Ala Gly Gly Cys Ser Gly Pro Gin Asp 

225 230 235 240 

aca gac tgc ttt gcc tgc egg cac ttc aat gac agt gga gcc tgt gta 768 

Thr Asp Cys Phe Ala Cys Arg His Phe Asn Asp Ser Gly Ala Cys Val 

245 250 255 

cct cgc tgt cca cag cct ctt gtc tac aac aag eta act ttc cag ctg 816 
Pro Arg Cys Pro Gin Pro Leu Val Tyr Asn Lys Leu Thr Phe Gin Leu 

260 265 270 

gaa ccc aat ccc cac acc aag tat cag tat gga gga gtt tgt gta gcc 864 

Glu Pro Asn Pro His Thr Lys Tyr Gin Tyr Gly Gly Val Cys Val Ala 

275 280 285 

age tgt ccc cat aac ttt gtg 9tg gat caa aca tec tgt gtc agg gcc 912 

Ser Cys Pro His Asn Phe Val Val Asp Gin Thr Ser Cys Val Arg Ala 

290 295 300 

tgt cct cct gac aag atg gaa gta gat aaa aat ggg etc aag atg tgt 960 

Cys Pro Pro Asp Lys Met Glu. Val Asp Lys Asn Gly Leu Lys Met Cys 
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305 310 315 320 

gag cct tgt ggg gga eta tgt ccc aaa gec tgt gag gga aca ggc tct 
1008 

Glu Pro Cys Gly Gly Leu Cys Pro Lys Ala Cys Glu Gly Thr Gly Ser 
325 330 335 

999 age cgc ttc cag act gtg gac teg age aac att gat gga ttt gtg 
1056 

Gly Ser Arg Phe Gin Thr Val Asp Ser Ser Asn lie Asp Gly Phe Val 
340 345 350 

aac tgc ace aag ate ctg ggc aac ctg gac ttt ctg ate acc ggc etc 
1104 

Asn Cys. Thr Lys lie Leu Gly Asn Leu Asp Phe Leu lie Thr Gly Leu 
355 360 365 

aat gga gac ccc tgg cac aag ate cct gec ctg gac cca gag aag etc 
1152 

Asn Gly Asp Pro .Trp His Lys lie Pro Ala Leu Asp Pro Glu Lys Leu 

370 375 ' 380 

aat gtc ttc egg aca gta egg gag ate aca ggt tac ctg aac ate cag 
1200 

Asn Val Phe Arg Thr Val Arg Glu lie Thr Gly Tyr Leu Asn lie Gin 

385 390 395 400 

tec tgg ccg ccc cac atg cac aac ttc agt gtt ttt tec aat ttg aca 
1248 

Ser Trp Pro Pro His Met His Asn Phe Ser Val Phe Ser Asn Leu Thr 
405 410 415 

acc att gga ggc aga age etc tac aac egg ggc ttc tea ttg ttg ate 
1296 

Thr lie Gly Gly Arg Ser Leu Tyr Asn Arg Gly Phe Ser Leu Leu lie 
420 425 430 

atg aag aac ttg aat gtc aca tct . ctg ggc ttc cga tec ctg aag gaa 
1344 

Met Lys Asn Leu Asn Val Thr Ser Leu Gly Phe Arg Ser Leu Lys Glu 
435 440 445 

att agt get ggg cgt ate tat ata agt gee aat agg cag etc tgc tac 
1392 

lie Ser Ala Gly Arg lie Tyr lie Ser Ala Asn Arg Gin Leu Cys Tyr 
450 455 460 

cac cac tct ttg aac tgg acc aag gtg ctt egg ggg cct acg gaa gag 
1440 

His His Ser Leu Asn Trp Thr Lys Val Leu Arg Gly Pro Thr Glu Glu 
465 470 475 480 
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cga eta gac ate aag cat aat egg ccg cgc aga gac tgc gtg gca gag 
1488 

Arg Leu Asp He Lys His Asn Arg Pro Arg Arg Asp Cys Val Ala Glu 
485 490 1 495 

ggc aaa gtg tgt gac cca ctg tgc tec tct ggg gga tgc tgg ggc cca 
1536 

Gly Lys Val Cys Asp Pro Leu Cys Ser Ser Gly Gly Cys Trp Gly Pro 
500 505 510 

ggc cct ggt cag tgc ttg tec tgt cga aat tat age cga gga ggt gtc 
1584 

Gly Pro Gly Gin Cys Leu Ser Cys Arg Asn Tyr Ser Arg Gly Gly Val 
515 ^520 525 

fc 9t gtg ace cac tgc aac ttt ctg aat ggg gag cct cga gaa ttt gec 
1632 

Cys Val Thr His Cys Asn Phe Leu Asn Gly Glu Pro Arg Glu Phe Ala 
530 535 540 

cat gag gec gaa tgc ttc tec tgc cac ccg gaa tgc caa ccc atg ggg 
1680 

His Glu Ala Glu Cys Phe Ser Cys His Pro Glu Cys Gin Pro Met Gly 

545 550 555 560 

ggc act gee aca tgc aat ggc teg ggc tct gat act tgt get caa tgt 
1728 

Gly Thr Ala Thr Cys Asn Gly Ser Gly Ser Asp Thr Cys Ala Gin Cys 
565 570 575 

gee cat ttt cga gat ggg ccc cac tgt gtg age age tgc ccc cat gga 
1776 

Ala His Phe Arg Asp Gly Pro His Cys Val Ser Ser Cys Pro His Gly 

580 585 590 

gtc eta ggt gee aag ggc cca ate tac aag tac cca. gat gtt cag aat 
1824 

Val . Leu Gly Ala Lys Gly Pro He Tyr Lys Tyr Pro Asp Val Gin Asn 
595. 600 605 

gaa tgt egg ccc tgc cat gag aac tgc ace cag ggg tgt aaa gga cca 
1872 

Glu Cys Arg Pro Cys His Glu Asn Cys Thr Gin Gly Cys Lys Gly Pro 

610 615 620 

gag ctt caa gac tgt tta gga caa aca ctg gtg ctg ate ggc aaa ace 
1920 

Glu Leu Gin Asp Cys Leu Gly Gin Thr Leu Val Leu He Gly Lys . Thr 

625 630 635 640 

cat ctg aca atg get ttg aca gtg ata gca gga ttg gta gtg att ttc. 
1968 

His Leu Thr Met Ala Leu Thr Val He Ala Gly Leu Val Val He Phe 
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atg atg ctg ggc 
2016 

Met Met Leu Gly 
660 

aat aaa agg get 
2064 

Asn Lys Arg Ala 
675 

cct ctg gac ccc 
2112 

Pro Leu Asp Pro 
690 

aaa gag aca gag 
2160 

Lys Glu Thr Glu 
705 

gga act gtg cac 
2208 

Gly Thr Val His 



att cca gtc tgc 
2256 

lie Pro Val Cys 
740 

ttt caa get gtg 
2304 

Phe Gin Ala Val 
755 

gec cac att gta 
2352 

Ala His He Val 
770 

ctt gtc act caa 
2400 

Leu Val Thr Gin 
785 . 

caa cac egg ggg 
2448 

Gin His Arg Gly 



645 

ggc act ttt etc 
Gly Thr Phe Leu 

atg agg cga tac 

Met Arg Arg Tyr 
680 

agt gag aag get 

Ser Glu Lys Ala 
695 

eta agg aag ctt 

Leu Arg Lys Leu 
710 

aaa gga gtg tgg 

Lys Gly Val Trp 
725 

att aaa gtc att 
He Lys Val He 

aca gat cat atg 

Thr Asp His Met 
760 

agg ctg ctg gga 

Arg Leu Leu Gly 
775 

tat ttg cct ctg 

Tyr Leu Pro Leu 
790 

gca ctg ggg cca 

Ala Leu Gly Pro 
805 



650 

tac tgg cgt ggg 

Tyr Trp Arg Gly 
665 

ttg gaa egg ggt 
Leu Glu Arg Gly 

aac aaa gtc ttg 

Asn Lys Val Leu 
700 

aaa gtg ctt ggc 

Lys Val Leu Gly 
715 

ate cct gag ggt 

He Pro Glu Gly 
730 

gag gac aag agt 

Glu Asp Lys Ser 
745 

ctg gee att ggc 
Leu Ala He Gly 

eta tgc cca ggg 

Leu Cys Pro Gly 
780 

ggt tct ctg ctg 

Gly Ser Leu Leu 
795 

cag ctg ctg etc 

Gin Leu Leu Leu 
810 



655 

cgc egg att cag 

Arg Arg He Gin 
670 

gag age ata gag 

Glu Ser He Glu 
685 

gee aga ate ttc 
Ala Arg He Phe 

teg. ggt gtc ttt 

Ser Gly Val Phe 
720 

gaa tea ate aag 

Glu Ser He Lys 
735 

gga egg cag agt 

Gly Arg Gin Ser 
750 

age ctg gac cat 

Ser Leu Asp His 
765 

tea tct ctg cag 
Ser Ser Leu Gin 

gat cat gtg aga 

Asp His Val Arg 
800 

aac tgg gga gta 

Asn Trp Gly Val 
815 
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caa att gcc aag 
2496 

Gin lie Ala Lys 
820 

aga aac ctg get 
2544 

Arg Asn Leu Ala 
835 

ca 9 gtg gca gat 
2592 

Gin Val Ala Asp 
850 

cag ctg eta tac 
2640 

Gin Leu Leu Tyr 
865 

gag agt ate cac 
2688 

Glu Ser He His 



tat ggt gtg aca 
2736 

Tyr Gly Val Thr 
900 

gca ggg eta cga 
2784 

Ala Gly Leu Arg 
915 

egg ttg gca cag 
2832 

Arg Leu Ala Gin 
93 0 

gtc aag tgt tgg 
2880 

Val Lys Cys Trp 
945 

eta gcc aat gag 
2928 

Leu Ala Asn Glu 



gtc ata aag aga 
2976. 

Val He Lys Arg 



gga atg tac tac 
Gly Met Tyr Tyr 

gcc cga aac gtg 

Ala Arg Asn Val 
840 

ttt ggt gtg get 

Phe Gly Val Ala 
855 

agt gag gcc aag 

Ser Glu Ala Lys 
870 

ttt ggg aaa tac 

Phe Gly Lys Tyr 
885 

gtt tgg gag ttg 
Val Trp Glu Leu 

ttg get gaa gta 

Leu Ala Glu Val 
920 

ccc cag ate tgc 

Pro Gin He Cys 
935 

atg att gat gag 

Met He Asp Glu 
950 

ttc acc agg atg 

Phe Thr Arg Met 
965 

gag agt ggg cct 
Glu Ser Gly Pro 



ctt gag gaa cat 

Leu Glu Glu His 
825 

eta etc aag tea 
Leu Leu Lys Ser 

gac ctg ctg cct 

Asp Leu Leu Pro 
860 

act cca att aag 

Thr Pro He Lys 
875 

aca cac cag agt 

Thr His Gin Ser 
890 

atg acc ttc ggg 

Met Thr Phe Gly 
905 

cca gac ctg eta 
Pro Asp Leu Leu 

aca att gat gtc 

Thr He Asp Val 
940 

aac att cgc cca 

Asn lie Arg Pro 
955 

gcc cga gac cca 

Ala Arg Asp Pro 

.970 

gga ata gcc cct 

Gly He Ala Pro 



ggt atg gtg cat 

Gly Met Val His 
830 

ccc agt cag gtt 

Pro Ser Gin Val 
845 

cct gat gat aag 
Pro Asp Asp Lys 

tgg atg gcc ctt 

Trp Met Ala Leu 
880 

gat gtc tgg age 

Asp Val Trp Ser 
895 

gca gag ccc tat 

Ala Glu Pro Tyr 
910 

gag aag ggg gag 

Glu Lys Gly Glu 
925 

tac atg gtg atg 
Tyr Met Val Met 

acc ttt aaa gaa 

Thr Phe Lys Glu 
960 

cca egg tat ctg 

Pro Arg Tyr Leu 
975 

ggg cca gag ccc 
Gly Pro Glu Pro 
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980 985 990 

cat ggt ctg aca aac aag aag eta gag gaa gta gag ctg gag cca gaa 
3024 . 

His Gly Leu Thr Asn Lys Lys Leu Glu Glu Val Glu Leu Glu Pro Glu 
995 1000 1005 

eta gac eta gac eta gac ttg gaa gca gag gag .gac aac ctg gca acc 
3072 

Leu Asp Leu Asp Leu Asp Leu Glu Ala Glu Glu Asp Asn Leu Ala Thr 
1010 1015 1020 

t 

acc aca ctg ggc tec gee etc age eta cca gtt gga aca ctt aat egg 
3120 

Thr Thr Leu Gly Ser Ala Leu Ser Leu Pro Val Gly Thr Leu Asn Arg 
1025 1030 1035 1040 

cca cgt ggg age cag age ctt tta agt cca tea tct gga tac atg ccc 
3168 

Pro Arg Gly Ser Gin Ser Leu Leu Ser Pro Ser Ser Gly Tyr Met Pro 
1045 « 1050 - 1055 

atg aac cag ggt aat ctt 'ggg ggg tct tgc cag gag tct gca gtt tct 
3216 

Met Asn Gin Gly Asn Leu Gly Gly Ser Cys Gin Glu Ser Ala Val Ser 
1060 1065 1070 

999 a 9 c a 9t gaa egg tgc ccc cgt cca gtc tct eta cac cca atg cca 
3264 

Gly Ser Ser Glu Arg Cys Pro Arg Pro Val Ser Leu His Pro Met Pro 
1075 1080 1085 

c 99 99 a tgc ctg gca tea gag tea tea gag ggg cat gta aca ggc tct 
3312 

Arg Gly Cys Leu Ala Ser Glu Ser Ser Glu Gly His Val Thr Gly Ser 
1090 1095 1100 

gag get gag etc cag gag aaa gtg tea atg tgt aga age egg age agg 
3360 

Glu Ala Glu Leu Gin Glu Lys Val Ser Met Cys Arg Ser Arg Ser Arg 
1105 1110 1115 1120 

age egg age cca egg cca cgc gga gat age gec tac cat tec cag cgc 
3408 

Ser Arg Ser Pro Arg Pro Arg Gly Asp Ser Ala Tyr His Ser Gin Arg 

1125 1130 1135 

cac agt ctg ctg act cct gtt acc cca etc tec cca ccc ggg tta gag 
3456 

His Ser Leu Leu Thr Pro Val Thr Pro Leu Ser Pro Pro Gly Leu Glu 

1140 - 1145 1150 
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gaa gag gat gtc aac ggt tat gtc atg cca gat aca cac etc aaa ggt 
3504 

Glu Glu Asp Val Asn Gly Tyr Val Met Pro Asp Thr His Leu Lys Gly 
1155 1160 1165 

act ccc tec tec egg gaa ggc ace ctt tct tea gtg ggt etc agt tct 
3552 

Thr Pro Ser Ser Arg Glu Gly Thr Leu Ser Ser Val Gly Leu Ser Ser 
1170 1175 . 1180 

gtc ctg ggt act gaa gaa gaa gat gaa gat gag gag tat gaa tac atg 
3600 

Val Leu Gly Thr Glu Glu Glu Asp Glu Asp Glu Glu Tyr Glu Tyr Met 
1185 1190 1195 1200 

aac egg agg aga agg cac agt cca cct cat ccc cct agg cca agt tec 
3648 

Asn Arg Arg Arg Arg His Ser Pro Pro His Pro Pro Arg Pro Ser Ser 
1205 1210 1215. 

ctt gag gag ctg ggt tat gag tac atg gat gtg ggg tea gac etc agt 
3696 

Leu Glu Glu Leu Gly Tyr Glu Tyr Met Asp Val Gly Ser Asp Leu Ser 
1220 1225 1230 

gec tct ctg ggc age aca cag agt tgc cca etc cac cct gta ccc ate 
3744 

Ala Ser Leu Gly Ser Thr Gin Ser Cys Pro Leu His Pro Val Pro lie 
1235 1240 1245 

atg ccc act gca ggc aca act cca gat gaa gac tat gaa tat atg aat 
3792 

Met Pro Thr Ala Gly Thr Thr Pro Asp Glu Asp Tyr Glu Tyr Met Asn 
1250 1255 1260 

egg caa cga gat gga ggt ggt cct ggg ggt gat tat gca gec atg ggg 
3840 

Arg Gin Arg Asp Gly Gly Gly Pro Gly Gly Asp Tyr Ala Ala Met Gly 
1265 1270 1275 1280 

gec tgc cca gca tct gag caa ggg tat gaa gag atg aga get ttt cag 
3888 

Ala Cys Pro Ala Ser Glu Gin Gly Tyr Glu Glu Met Arg Ala Phe Gin 
1285 1290 1295 

ggg cct gga cat cag gec ccc cat gtc cat tat gec cgc eta aaa act 
3936 

Gly Pro Gly His Gin Ala Pro His Val His Tyr Ala Arg Leu Lys Thr 
1300 1305 1310 

eta cgt age tta gag get aca gac tct gec ttt gat aac cct gat tac 
3984 

Leu Arg Ser Leu Glu Ala Thr Asp Ser Ala Phe Asp Asn Pro Asp Tyr 
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1315 



1320 



1325 



tgg cat age agg ctt ttc 
4026 

Trp His Ser Arg Leu Phe 



ccc aag get 



aat gec cag aga acg 



1330 



Pro Lys Ala 
1335 



As n Ala Gin Arg Thr 
1340 



<210> 2 

<211> 1342 

<212> PRT 

<213> Homo sapiens 



<400> 2 


















Met 


Arg 


Ala 


Asn 


Asp Ala 


Leu 


Gin Val 


Leu Gly Leu Leu Phe 


Ser 


Leu 


1 








5 ' 






10 


15 




Ala 


Arg 


Gly 


Ser 


Glu Val 


Gly 


Asn Ser 


Gin Ala Val Cys Pro 


Gly 


Thr 








20 






25 


30 






Leu 


Asn 


Gly 


Leu 


Ser Val 


Thr 


Gly Asp 


Ala Glu Asn Gin Tyr 


Gin 


Thr 






35 








40 


45 






Leu 


Tyr 


Lys 


Leu 


Tyr Glu 


Arg 


Cys Glu 


Val Val Met Gly Asn 


Leu 


Glu 




50 








55 




60 






lie 


Val 


Leu 


Thr 


Gly His 


Asn 


Ala Asp 


Leu Ser Phe Leu Gin 


Trp 


He 


65 








70 






75 




80 


Arg 


Glu 


Val 


Thr 


Gly Tyr 


Val 


Leu Val 


Ala Met Asn Glu Phe 


Ser 


Thr 








85 






90 


95 




Leu 


Pro 


Leu 


Pro 


Asn Leu 


Arg 


Val Val 


Arg Gly Thr Gin Val 


Tyr 


Asp 








100 






105 


110 






Gly 


Lys 


Phe 


Ala 


He Phe 


Val 


Met Leu 


Asn Tyr Asn Thr Asn 


Ser 


Ser 


115 








120 


125 






His 


Ala 


Leu 


Arg 


Gin Leu 


Arg 


Leu Thr 


Gin Leu Thr Glu He 


Leu 


Ser 




130 








135 




140 






Gly 


Gly 


Val 


Tyr 


He Glu 


Lys 


Asn Asp 


Lys Leu Cys His Met 


Asp 


Thr 


145 








150 






155 




160 


lie 


Asp 


Trp 


Arg 


Asp He 


Val 


Arg Asp 


Arg Asp Ala Glu He 


Val 


Val 










165 






170 


175 




Lys 


Asp 


Asn 


Gly 


Arg Ser 


Cys 


Pro Pro 


Cys His Glu Val Cys 


Lys 


Gly 








180 






185 


190 






Arg 


Cys 


Trp 


Gly 


Pro Gly 


Ser 


Glu Asp 


Cys Gin Thr Leu Thr 


Lys 


Thr 






195 








200 


205 






lie 


Cys 


Ala 


Pro 


Gin Cys 


Asn 


Gly His 


Cys Phe Gly Pro Asn 


Pro 


Asn 




210 








215 




220 






Gin 


Cys 


Cys 


His 


Asp Glu 


Cys 


Ala Gly 


Gly Cys Ser Gly Pro 


Gin 


Asp 


225 








230 






235 




240 


Thr 


Asp 


Cys 


Phe 


Ala Cys 


Arg 


His Phe 


Asn Asp Ser Gly Ala 


Cys 


Val 










245 






250 


255 




Pro 


Arg 


Cys 


Pro 


Gin Pro 


Leu 


Val Tyr 


Asn Lys Leu Thr Phe 


Gin 


Leu 








260 






265 


270 






Glu 


Pro 


Asn 


Pro 


His Thr 


Lys 


Tyr Gin 


Tyr Gly Gly Val Cys 


Val 


Ala 






275 








280 


285 






Ser 


Cys 


Pro 


His 


Asn Phe 


Val 


Val Asp 


Gin Thr Ser Cys Val 


Arg 


Ala 




290 








295 




300 
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Cys 


Pro 


Pro Asp 


Lys 


Met 


Glu 


Val 


Asp 


Lys 


Asn 


Gly 


Leu 


Lys 


Met 


Cys 


305 








310 










315 










320 


Glu 


Pro 


Cys Gly 


Gly 


Leu 


Cys 


Pro 


Lys 


Ala 


Cys 


Glu 


Gly 


Thr 


Gly 


Ser 








325 










330 










335 




Gly 


Ser 


Arg Phe 


Gin 


Thr 


Val 


Asp 


Ser 


Ser 


Asn 


He 


Asp 


Gly 


Phe 


Val 






340 










345 










350 






Asn 


Cys 


Thr Lys 


He 


Leu 


Gly 


Asn 


Leu 


Asp 


Phe 


Leu 


He 


Thr 


Gly 


Leu 






355 








360 










365 








Asn 


Gly 


Asp Pro 


Trp 


His 


Lys 


He 


Pro 


Ala 


Leu 


Asp 


Pro 


Glu 


Lys 


Leu 




370 








375 










380 










Asn 


Val 


Phe Arg 


Thr 


Val 


Arg 


Glu 


He 


Thr 


Gly 


Tyr 


Leu 


Asn 


lie 


Gin 


385 








390 










395 










400 


Ser 


Trp 


Pro Pro 


His 


Met 


His 


Asn 


Phe 


Ser 


Val 


Phe 


Ser 


Asn 


Leu 


Thr' 








405 










410 










415 




Thr 


He 


Gly Gly 


Arg 


Ser 


Leu 


Tyr 


Asn 


Arg 


Gly 


Phe 


Ser 


Leu 


Leu 


He 






,420 










425 










430 






Met 


Lys 


Asn Leu 


Asn 


Val 


Thr 


Ser 


Leu 


Gly 


Phe 


Arg 


Ser 


Leu 


Lys 


Glu 






435 








440 










445 








He 


Ser 


Ala Gly 


Arg 


He 


Tyr 


He 


Ser 


Ala 


Asn 


Arg 


Gin 


Leu 


Cys 


Tyr 




450 








455 










460 










His 


His 


Ser Leu 


Asn 


Trp 


Thr 


Lys 


Val 


Leu 


Arg 


Gly 


Pro 


Thr 


Glu 


Glu 


465 








470 










475 










480 


Arg 


Leu 


Asp He 


Lys 


His 


Asn 


Arg 


Pro 


Arg 


Arg 


Asp 


Cys 


Val 


Ala 


Glu 








485 










490 










495 




Gly 


Lys 


Val Cys 


Asp 


Pro 


Leu 


Cys 


Ser 


Ser 


Gly 


Gly 


Cys 


Trp 


Gly 


Pro 






500 










505 










510 






Giy 


Pro 


Gly Gin 


Cys 


Leu 


Ser 


Cys 


Arg 


Asn 


Tyr 


Ser 


Arg 


Gly 


Gly 


Val 






515 








520 










525 








Cys 


Val 


Thr His 


Cys 


Asn 


Phe 


Leu 


Asn 


Gly 


Glu 


Pro 


Arg 


Glu 


Phe 


Ala 




530 








535 










540 










His 


Glu 


Ala Glu 


Cys 


Phe 


Ser 


Cys 


His 


Pro 


Glu 


Cys 


Gin 


Pro 


Met 


Gly 


545 








550 










555 










560 


Gly 


Thr 


Ala Thr 


Cys 


Asn 


Gly 


Ser 


Gly 


Ser 


Asp 


Thr 


Cys 


Ala 


Gin 


Cys 








565 










570 










575 




Ala 


His 


Phe Arg 


Asp 


Gly 


Pro 


His 


Cys 


Val 


Ser 


Ser 


Cys 


Pro 


His 


Gly 






580 










585 










590 






Val 


Leu 


Gly Ala 


Lys 


Gly 


Pro 


He 


Tyr 


Lys 


Tyr 


Pro 


Asp 


Val 


Gin 


Asn 






595 








600 










605 








Glu 


Cys 


Arg Pro 


Cys 


His 


Glu 


Asn 


Cys 


Thr 


Gin 


Gly 


Cys 


Lys 


Gly 


Pro 




610 








615 










620 










Glu 


Leu 


Gin Asp 


Cys 


Leu 


Gly 


Gin 


Thr 


Leu 


Val 


Leu 


He 


Gly 


Lys 


Thr 


625 








630 










635 










640 


His 


Leu 


Thr Met 


Ala 


Leu 


Thr 


Val 


He 


Ala 


Gly 


Leu 


Val 


Val 


He 


Phe 








645 










650 










655 




Met 


Met 


Leu Gly 


Gly 


Thr 


Phe 


Leu 


Tyr 


Trp 


Arg 


Gly 


Arg 


Arg 


He 


Gin 






660 










665 










670 






,Asn 


Lys 


Arg Ala 


Met 


Arg 


Arg 


Tyr 


Leu 


Glu 


Arg 


Gly 


Glu 


Ser 


He 


Glu 






675 








680 










685 








Pro 


Leu 


Asp Pro 


Ser 


Glu 


Lys 


Ala 


Asn 


Lys 


Val 


Leu 


Ala 


Arg 


He 


Phe 




690 








695 










700 










Lys 


Glu 


Thr Glu 


Leu 


Arg 


Lys 


Leu 


Lys 


Val 


Leu 


Gly 


Ser 


Gly 


Val 


Phe 


705 








710 










715 










720 


Gly 


Thr 


Val His 


Lys 


Gly 


Val 


Trp 


He 


Pro 


Glu 


Gly 


Glu 


Ser 


He 


Lys 
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725 










730 










735 




lie 


Pro 


Val Cys He 


Lys 


Val 


He 


Glu 


Asp 


Lys 


Ser 


Gly 


Arg 


Gin 


Ser 






740 








745 










750 






Phe 


Gin 


Ala Val Thr 


Asp 


His 


Met 


Leu 


Ala 


He 


Gly 


Ser 


Leu 


Asp 


His 






755 






760 










765 








Ala 


His 


He Val Arg 


Leu 


Leu 


Gly 


Leu 


Cys 


Pro 


Gly 


Ser 


Ser 


Leu 


Gin 




770 






775 










780 










Leu 


Val 


Thr Gin Tyr 


Leu 


Pro 


Leu 


Gly 


Ser 


Leu 


Leu 


Asp 


His 


Val 


Arg 


785 






790 










795 










800 


Gin 


His 


Arg Gly Ala 


Leu 


Gly 


Pro 


Gin 


Leu 


Leu 


Leu 


Asn 


Trp 


Gly 


Val 






805 










810 










815 




Gin 


He 


Ala Lys Gly 


Met 


Tyr 


Tyr 


Leu 


Glu 


Glu 


His 


Gly 


Met 


Val 


His 






820 








825 










830 






Arg 


Asn 


Leu Ala Ala 


Arg 


Asn 


Val 


Leu 


Leu 


Lys 


Ser 


Pro 


Ser 


Gin 


Val 






835 






840 










845 








Gin 


Val 


Ala Asp Phe 


Gly 


Val 


Ala 


Asp 


Leu 


Leu 


Pro 


Pro 


Asp 


Asp 


Lys 




850 






855 










860 










Gin 


Leu 


Leu Tyr Ser 


Glu 


Ala 


Lys 


Thr 


Pro 


He 


Lys 


Trp 


Met 


Ala 


Leu 


865 






870 










875 










880 


Glu 


Ser 


He His Phe 


Gly 


Lys 


Tyr 


Thr 


His 


Gin 


Ser 


Asp 


Val 


Trp 


Ser 






885 










890 










895 




Tyr 


Gly 


Val Thr Val 


Trp 


Glu 


Leu 


Met 


Thr 


Phe 


Gly 


Ala 


Glu 


Pro 


Tyr 






900 








905 










910 






Ala 


Gly 


Leu Arg Leu 


Ala 


Glu 


Val 


Pro 


Asp 


Leu 


Leu 


Glu 


Lys 


Gly 


Glu 






915 






920 










925 








Arg 


Leu 


Ala Gin Pro 


Gin 


He 


Cys 


Thr 


He 


Asp 


Val 


Tyr 


Met 


Val 


Met 


930 






935 










940 










Val 


Lys 


Cys Trp Met 


He 


Asp 


Glu 


Asn 


He 


Arg 


Pro 


Thr 


Phe 


Lys 


Glu 


945 






950 










955 










960 


Leu 


Ala 


Asn Glu Phe 


Thr 


Arg 


Met 


Ala 


Arg 


Asp 


Pro 


Pro 


Arg 


Tyr 


Leu 






965 










970 










975 




Val 


He 


Lys Arg Glu 


Ser 


Gly 


Pro 


Gly 


He 


Ala 


Pro 


Gly 


Pro 


Glu 


Pro 






980 








985 










990 






His 


Gly 


Leu Thr Asn 


Lys 


Lys 


Leu 


Glu 


Glu 


Val 


Glu 


Leu 


Glu 


Pro 


Glu 






995 






1000 








1005 






Leu 


Asp 


Leu Asp Leu 


Asp 


Leu 


Glu Ala 


Glu 


Glu 


Asp 


Asn 


Leu 


Ala 


Thr 




1010 




1015 








1020 








Thr 


Thr 


Leu Gly Ser 


Ala 


Leu 


Ser 


Leu 


Pro 


Val 


Gly Thr Leu 


Asn 


Arg 


1025 




1030 








1035 








1041 


Pro Arg Gly Ser Gin 


Ser 


Leu 


Leu 


Ser 


Pro 


Ser 


Ser Gly Tyr 


Met 


Pro 






1045 








1050 








1055 


Met 


Asn 


Gin Gly Asn Leu Gly Gly Ser 


Cys 


Gin 


Glu 


Ser 


Ala 


Val 


Ser 



1060 1065 . 1070 



Gly Ser Ser Glu Arg Cys Pro Arg Pro Val Ser I*eu His Pro Met Pro 

1075 1080 1085 

Arg Gly Cys Leu Ala Ser Glu Ser Ser Glu Gly His Val Thr Gly Ser 

1090 1095 1100 

Glu Ala Glu Leu Gin Glu Lys Val Ser Met Cys Arg Ser Arg Ser Arg 
1105 1110 1115 1120 

Ser Arg Ser Pro Arg Pro Arg Gly Asp Ser Ala Tyr His Ser Gin Arg 

1125 1130 1135 

His Ser Leu Leu Thr Pro Val Thr Pro Leu Ser Pro Pro Gly Leu Glu 
1140 1145 1150 
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Glu Glu Asp Val Asn Gly Tyr Val Met Pro Asp Thr His Leu Lys Gly 

1155 1160 1165 

Thr Pro Ser Ser Arg Glu Gly Thr Leu Ser Ser Val Gly Leu Ser Ser 

1170 1175 1180 

Val Leu Gly Thr Glu Glu Glu Asp Glu Asp Glu Glu Tyr Glu Tyr Met 
1185 1190 1195 1200 

Asn Arg Arg Arg Arg His Ser Pro Pro His Pro Pro Arg Pro Ser Ser 

1205 1210 1215 

Leu Glu Glu Leu Gly Tyr Glu Tyr Met Asp Val Gly Ser Asp Leu Ser 

1220 1225 1230 

Ala Ser Leu Gly Ser Thr Gin Ser Cys Pro Leu His Pro Val Pro lie 

. 1235 1240 1245 

Met Pro Thr Ala Gly Thr Thr Pro Asp Glu Asp Tyr Glu Tyr Met Asn 

1250 1255 1260 

Arg Gin Arg Asp Gly Gly Gly Pro Gly Gly Asp Tyr Ala Ala Met Gly 
1265 1270 1275 1280 

Ala Cys Pro Ala Ser Glu Gin Gly Tyr Glu Glu Met Arg Ala Phe Gin 

1285 1290 1295 

Gly Pro Gly His Gin Ala Pro His Val His Tyr Ala Arg Leu Lys Thr 

1300 1305 1310 

Leu Arg Ser Leu Glu Ala Thr Asp Ser Ala Phe Asp Asn Pro Asp Tyr 

1315 1320 1325 

Trp His Ser Arg Leu Phe Pro Lys Ala Asn Ala Gin Arg Thr 
1330 1335 1340 



<210> 3 
<211> 1935 
<212> DNA 

<213> Homo sapiens . 

<220> 

<221> CDS 

<222> (1) . . . (1935) 



<400> 3 

atg tec gag cgc aaa gaa ggc aga ggc aaa ggg aag ggc aag aag aag 4 8 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys 
1 5 . 10 15 

gag cga ggc tec ggc aag aag ccg gag tec gcg gcg ggc age cag age 96 

Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser 
20 25 30 

cca gec ttg cct ccc caa ttg aaa gag atg aaa age cag gaa teg get 144 

Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala 
35 40 45 

gca ggt tec aaa eta gtc ctt egg tgt gaa acc agt tct gaa tac tec 192 

Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser 
50 55 60 



tct etc aga ttc aag tgg ttc aag aat ggg aat gaa ttg aat cga aaa 



240 
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Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys 
65 70 75 80 

aac aaa cca caa aat ate aag ata caa aaa aag cca ggg aag tea gaa 288 
Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro Gly Lys Ser Glu 
85 '90 * 95 

ctt cgc att aac aaa gca tea ctg get gat tct gga gag tat atg tgc 336 
Leu Arg lie Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys 
100 105 110 

aaa gtg ate age aaa tta gga aat gac agt gee tct gee aat ate ace 384 
Lys Val He Ser Lys Leu. Gly Asn Asp Ser Ala Ser Ala Asn lie Thr 
115 120 125 

ate gtg gaa tea aac gag ate ate act ggt atg cca gee tea act. gaa 432 
He Val Glu Ser Asn Glu He He Thr Gly Met Pro Ala Ser Thr Glu 
130 135 140 

gga gca tat gtg tct tea gag tct ccc att aga ata tea gta tec aca 480 
Gly Ala Tyr Val Ser Ser Glu Ser Pro He Arg He Ser Val Ser Thr 
145 150 155 160 

gaa gga gca aat act tct tea tct aca tct aca tec ace act ggg aca 528 
Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr 
165 170 175 

age cat ctt gta aaa tgt gcg gag aag gag aaa act ttc tgt gtg aat 576 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
180 185 190 

99 a 999 gag tgc ttc atg gtg aaa gac ctt tea aac ccc teg aga tac 624 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
195 200 205 

ttg tgc aag tgc cca aat gag ttt act ggt gat cgc tgc caa aac tac 672 
Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
210 215 220 

gta atg gee age ttc tac aag cat ctt ggg att gaa ttt atg gag gcg 720 
Val Met Ala Ser Phe Tyr Lys His Leu Gly He Glu Phe Met Glu Ala 
225 230 235 240 

gag gag ctg tac cag aag aga gtg ctg ace ata acc ggc ate tgc ate 768 
Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys He 
245 250 255 

gee etc ctt gtg gtc ggc ate atg tgt gtg gtg gee tac tgc aaa acc 816 
Ala Leu Leu Val Val Gly He Met Cys Val Val Ala Tyr Cys Lys Thr 
260 265 270 

aag aaa cag egg aaa aag ctg cat gac cgt ctt egg cag age ctt egg 864 
Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin Ser Leu Arg 
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275 

tct gaa cga aac 
Ser Glu Arg Asn 
290 

aac cca ccc ccc 
Asn Pro Pro Pro 
30S 

aac gtc ate tec 
1008 

Asn Val lie Ser 



ttt tec acc agt 
1056 

Phe Ser Thr Ser 
340 

acc cag act cct 
1104 

Thr Gin Thr Pro 
355 

ctt tec gaa age 
1152 

Leu Ser Glu Ser 
370 v 

agg cac age age 
1200 

Arg His Ser Ser 
385 

gga ggc cct cgt 
1248 

Gly Gly Pro Arg 



cct gat .tec tac 
1296 

Pro Asp Ser Tyr 
420 

atg acc acc ccg 
1344 

Met Thr Thr Pro 
435 

tec ccc aaa teg 
1392 

Ser Pro Lys Ser 



280 

aat atg atg aac 
Asn Met Met Asn 
295 

gag aat gtc cag 
Glu Asn Val Gin 
310 

agt gag cat att 

Ser Glu His He 
325 

cac tat act tec 
His Tyr Thr Ser 

age cac age tgg 

Ser His Ser Trp 
360 

cac tct gta ate 

His Ser Val He 
375 

cca act ggg ggc 

Pro Thr Gly Gly 
390 

gaa tgt aac age 

Glu Cys Asn Ser 
405 

cga gac tct cct 
Arg Asp Ser Pro 

get cgt atg tea 

Ala Arg Met Ser 
440 

ccc cct teg gaa 
Pro Pro Ser Glu 



att gee aat ggg 
He Ala Asn Gly 
300 

ctg gtg aat caa 
Leu Val Asn Gin 
315 

gtt gag aga gaa 

Val Glu Arg Glu 
330 

aca gee cat cac 

Thr Ala His His 
345 

age aac gga cac 
Ser Asn Gly His 

gtg atg tea. tec 

Val Met Ser Ser 
380 

cca aga gga cgt 

Pro Arg Gly Arg 
395 

ttc etc agg cat 

Phe Leu Arg His 
410 

cat agt gaa agg 

His Ser Glu Arg 
425 

cct gta gat ttc 
Pro Val Asp Phe 

atg tct cca ccc 
Met Ser Pro Pro 



285 

cct cac cat cct 912 
Pro His His Pro 

tac gta tct aaa 960 
Tyr Val Ser Lys 
320 

gca gag aca tec 

Ala Glu Thr Ser 
335 

tec act act gtc 

Ser Thr Thr Val 
350 

act gaa age ate 

Thr Glu Ser lie 
365 

gta gaa aac agt 
Val Glu Asn Ser 

ctt aat ggc aca 

Leu Asn Gly Thr 
400 

gee aga gaa acc 

Ala Arg Glu Thr 
415 

tat gtg tea gec 

Tyr Val Ser Ala 
430 

cac acg cca age 

His Thr Pro Ser . 
445 

gtg tec age atg 

Val Ser Ser Met 
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450 

acg gtg tec atg 
1440 

Thr Val Ser Met 
465 

aga cct eta ctt 
1488 

Arg Pro Leu lieu 



gac cat cac cct 
1536 

Asp His His Pro 
500 

gac agt aac age 
1584 

Asp Ser Asn Ser 
515 • 

gag tat gaa acg 
1632 

Glu Tyr Glu Thr 
530 

aaa etc gec aat 
1680 

Lys Leu Ala Asn 
545 

att get aac aga 
1728 

lie Ala Asn Arg 



aac tea gag agt 
1776 

Asn Ser Glu Ser 
580 

ttc ctg ggc ata 
1824 

Phe Leu Gly lie 
595 

gee ttc cgc ctg 
1872 

Ala Phe Arg Leu 
610 



455 

cct tec atg gcg 

Pro Ser Met Ala 
470 

etc gtg aca cca 

Leu Val Thr Pro 
485 

cag cag ttc age 

Gin Gin Phe Ser 

etc cct get age 

Leu Pro Ala Ser 
520 

acc caa gag tac 

Thr Gin Glu Tyr 
535 

age egg egg gec 

Ser Arg Arg Ala 
550 

ttg gaa gtg gac 

Leu Glu Val Asp 
565 

gaa aca gaa gat 
Glu Thr Glu Asp 

cag aac ccc ctg 

Gin Asn Pro Leu 
600 

get gac age agg 

Ala Asp Ser Arg 
615 



460 

gtc age ccc ttc 

Val Ser Pro Phe 
475 

cca agg ctg egg 

Pro Arg Leu Arg 

. 490 

tec ttc cac cac 

Ser Phe His His 
505 

ccc ttg agg ata 
Pro Leu Arg lie 

gag cca gee caa 

Glu Pro Ala Gin 
540 

aaa aga acc aag 

Lys Arg Thr Lys 
555 

age aac aca age 

Ser Asn Thr Ser 
570 

gaa aga gta ggt 

Glu Arg Val Gly 
585 

gca gec agt ctt 

Ala Ala Ser Leu 

act aac cca gca 

Thr Asn Pro Ala 
620 



atg gaa gaa gag 

Met Glu Glu Glu 
480 

gag aag aag ttt 

Glu Lys Lys Phe 
495 

aac ccc gcg cat 

Asn Pro Ala His 
510 

gtg gag gat gag 

Val Glu Asp Glu 
525 

gag cct gtt aag 
Glu Pro Val Lys 

ccc aat ggc cac 

Pro Asn Gly His 
560 

tec cag age agt 

Ser Gin Ser Ser 
575 

gaa gat acg cct 

Glu Asp Thr Pro 
590 

gag gca aca cct 

Glu Ala Thr Pro 
605 

ggc cgc ttc teg 
Gly Arg Phe Ser 
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aca cag gaa gaa ate cag gec agg ctg tct agt gta att get aac caa 
1920 

Thr Gin Glu Glu He Gin Ala Arg Leu Ser Ser Val He Ala Asn Gin 
625 630 635 640 

gac cct att get gta 
1935 

Asp Pro He Ala Val 
645 



<210> 4 
<211> 645 
<212> PRT 

<213> Homo sapiens 



<400> 4 



Met 


Ser 


Glu 


Arg 


Lys 


Glu 


Gly 


Arg 


1 








5 








Glu 


Arg 


Gly 


Ser 


Gly 


Lys 


Lys 


Pro 








20 










Pro 


Ala 


Leu 


Pro 


Pro 


Gin 


Leu 


Lys 






35 










40 


Ala 


Gly 


Ser 


Lys 


Leu 


Val 


Leu 


Arg 




50 










55 




Ser 


Leu 


Arg 


Phe 


Lys 


Trp 


Phe 


Lys 


65 










70 






Asn 


Lys 


Pro 


Gin 


Asn 


He 


Lys 


He 










85 








Leu 


Arg 


He 


Asn 


Lys 


Ala 


Ser 


Leu 








100 










Lys 


Val 


He 


Ser 


Lys 


Leu 


Gly 


Asn 






115 










120 


He 


Val 


Glu 


Ser 


Asn 


Glu 


He 


He 




130 










135 




Gly 


Ala 


Tyr 


Val 


Ser 


Ser 


Glu 


Ser 


145 










150 






Glu 


Gly 


Ala 


Asn 


Thr 


Ser 


Ser 


Ser 










165 








Ser 


His 


Leu 


Val 


Lys 


Cys 


Ala 


Glu 








180 










Gly 


Gly 


Glu 


Cys 


Phe 


Met 


Val 


Lys 






195 










200 


Leu 


Cys 


Lys 


Cys 


Pro 


Asn 


Glu 


Phe 




210 










215 




Val 


Met 


Ala 


Ser 


Phe 


Tyr 


Lys 


His 


225 










230 






Glu 


Glu 


Leu 


Tyr 


Gin 


Lys 


Arg 


Val 










245 








Ala 


Leu 


Leu 


Val 


Val 


Gly 


He 


Met 








260 










Lys 


Lys 


Gin 


Arg 


Lys 


Lys 


Leu 


His 






275 










280 



Gly 


Lys 


Gly Lys Gly Lys Lys 


Lys 




10 


15 




Glu 


Ser 


Ala Ala Gly Ser Gin 


Ser 


25 




30 




Glu 


Met 


Lys Ser Gin Glu Ser 


Ala 






45 




Cys 


Glu 


Thr Ser Ser Glu Tyr 


Ser 






60 




Asn 


Gly 


Asn Glu Leu Asn Arg 


Lys 






75 


80 


Gin 


Lys 


Lys Pro Gly Lys Ser 


Glu 




90 


95 




Ala 


Asp 


Ser Gly Glu Tyr Met 


Cys 


105 




110 




Asp 


Ser 


Ala Ser Ala Asn He 


Thr 






125 




Thr 


Gly 


Met Pro Ala Ser Thr 


Glu 






.140 




Pro 


He 


Arg lie Ser Val Ser 


Thr 






155 


160 


Thr 


Ser 


Thr Ser Thr Thr Gly 


Thr 




170 


175 




Lys 


Glu 


Lys Thr Phe Cys Val 


Asn 


185 




190 




Asp 


Leu 


Ser Asn Pro Ser Arg 


Tyr 






205 




Thr 


Gly 


Asp Arg Cys Gin Asn 


Tyr 






220 




Leu 


Gly 


He Glu Phe Met Glu- 


Ala 






235 


240 


Leu 


Thr 


He Thr Gly He Cys 


He 




250 


255 




Cys 


Val 


Va^L Ala Tyr Cys Lys 


Thr 


265 




270 




Asp 


Arg 


Leu Arg Gin Ser Leu 


Arg 



285 
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Ser 



Glu Arg Asn 
290 



Asn Met Met Asn He Ala Asn 
295 



Gly 
300 



Pro His 



His Pro 



Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys 
305 310 315 320 

Asn Val He Ser Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser 

325 330 335 

Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val 

340 345 350 

Thr Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser He 

355 360 365 

Leu Ser Glu Ser His Ser Val He Val Met Ser Ser Val Glu Asn Ser 

370 375 380 

Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr 
385 390 39.5 400 

Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr 

405 410 415 

Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala 

420 425 430 

Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro Ser 

435 440 445 

Ser Pro Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser Met 

450 455 460 

Thr Val Ser Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu 
465 470 475 480 

Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe 

485 490 495 

Asp His His Pro Gin Gin Phe Ser Ser Phe His His Asn Pro Ala His 

500 505 510 

Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg He Val Glu Asp Glu 

515 520 525 

Glu Tyr. Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys 

530 535 540 

Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His 
545 550 555 560 

lie Ala Asn Arg Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser 

565 570 575 

Asn Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro 

580 585 590 

Phe Leu Gly He Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro 

595 . 600 605 

Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe Ser 

610 615 620 

Thr Gin Glu Glu He Gin Ala Arg Leu Ser Ser Val He Ala Asn Gin 
625 630 635 640 

Asp Pro He Ala Val 



645 



<210> 5 

<211> 3765 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> CDS 

<222> (1) . . . (3765) 

<400> 5 

atg gag ctg gcg gcc ttg tgc cgc tgg ggg etc etc etc gee etc ttg 48 

Met Glu Leu Ala Ala Leu Cys Arg Trp Gly Leu Leu Leu Ala Leu Leu 

15 10 15 

ccc ccc gga gcc gcg age ace caa gtg tgc ace ggc aca gac atg aag 96 
Pro Pro Gly Ala Ala Ser Thr Gin Val Cys Thr Gly Thr Asp Met Lys 
20 25 30 

ctg egg etc cct gcc agt ccc gag ace cac ctg gac atg etc cgc cac 144 
Leu Arg Leu Pro Ala Ser Pro Glu Thr His Leu Asp Met Leu Arg His 
35 40 45 

etc tac cag ggc tgc cag gtg gtg cag gga aac ctg gaa etc acc tac 192 
Leu Tyr Gin Gly Cys Gin Val Val Gin Gly Asn Leu Glu Leu Thr Tyr 
50 55 60 

ctg ccc acc aat gcc age ctg tec ttc ctg cag gat ate cag gag gtg 240 
Leu Pro Thr Asn Ala Ser Leu Ser Phe Leu Gin Asp He Gin Glu Val 
65 70 75 80 

ca 9 99 c tac 9 fc 9 etc ate get cac aac caa gtg agg cag gtc cca ctg 2 88 
Gin Gly Tyr Val Leu He Ala His Asn Gin Val Arg Gin Val Pro Leu 
85 90 95 

cag agg ctg egg att gtg cga ggc acc cag etc ttt gag gac aac tat 336 
Gin Arg Leu Arg He Val Arg Gly Thr Gin Leu Phe Glu Asp Asn Tyr 
100 105 110 

gcc ctg gcc gtg eta gac aat gga gac ccg ctg aac aat acc acc cct 384 
Ala Leu Ala Val Leu Asp Asn Gly Asp Pro Leu Asn Asn Thr Thr Pro 
115 120 ' 125 

gtc aca ggg gcc tec cca gga ggc ctg egg gag ctg cag ctt cga age 432 
Val Thr Gly Ala Ser Pro Gly Gly Leu Arg Glu Leu Gin Leu Arg Ser 
130 135 140 

etc aca gag ate ttg aaa gga ggg gtc ttg ate cag egg aac ccc cag 4 80 
Leu Thr Glu He Leu Lys Gly Gly Val Leu He Gin Arg Asn Pro Gin 
145 150 155 160 

etc tgc tac cag gac acg att ttg tgg aag gac ate ttc cac aag aac 528 
Leu Cys Tyr Gin Asp Thr He Leu Trp Lys Asp lie Phe His Lys Asn 
165 170 175 

aac cag ctg get etc aca ctg ata gac acc aac cgc tct egg gcc tgc , 576 
Asn Gin Leu Ala Leu Thr Leu He Asp Thr Asn Arg Ser Arg Ala Cys 
180 185 190 
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cac ccc tgt tct ccg atg tgt aag ggc tec cgc tgc tgg gga gag agt 624 

His Pro Cys Ser Pro Met Cys Lys Gly Ser Arg Cys Trp Gly Glu Ser 

195 200 205 

tct gag gat tgt cag age ctg acg cgc act gtc tgt gec ggt ggc tgt 672 

Ser Glu Asp Cys Gin Ser Leu Thr Arg Thr Val Cys Ala Gly Gly Cys 

210 215 220 

gec cgc tgc aag ggg cca ctg ccc act gac tgc tgc cat gag cag tgt 720 

Ala Arg Cys Lys Gly Pro Leu Pro Thr Asp Cys Cys His Glu Gin Cys 

225 230 235 240 

get gec ggc tgc acg ggc ccc aag cac tct gac tgc ctg gec tgc etc 768 

Ala Ala Gly Cys Thr Gly Pro Lys His Ser Asp Cys Leu Ala Cys Leu 

245 250 255 

cac ttc aac cac agt ggc ate tgt gag ctg cac tgc cca gee ctg gtc 816 

His Phe Asn His Ser Gly lie Cys Glu Leu His Cys Pro Ala Leu Val 
260 265 270 

ace tac aac aca gac acg ttt gag tec atg ccc aat ccc gag ggc egg 864 

Thr Tyr Asn Thr Asp Thr Phe Glu Ser Met Pro Asn Pro Glu Gly Arg 

275 280 285 

tat aca ttc ggc gee age tgt gtg act gee tgt ccc tac aac tac ctt 912 

Tyr Thr Phe Gly Ala Ser Cys Val Thr Ala Cys Pro Tyr Asn Tyr Leu 

290 295 300 

tct acg gac gtg gga tec tgc ace etc gtc tgc ccc ctg cac aac caa 960 

Ser Thr Asp Val Gly Ser Cys Thr Leu Val Cys Pro Leu His Asn Gin 

305 310 315 320 

gag gtg aca gca gag gat gga aca cag egg tgt gag aag tgc. age aag 
1008 

Glu Val Thr Ala Glu Asp Gly Thr Gin Arg Cys Glu Lys Cys Ser Lys 

325 330 335 

ccc tgt gee cga gtg tgc tat ggt ctg ggc atg gag cac ttg cga gag 
1056 

Pro Cys Ala Arg Val Cys Tyr Gly Leu Gly Met Glu His Leu Arg Glu 
340 345 350 

gtg agg gca gtt ace agt gee aat ate cag gag ttt get ggc tgc aag 
1104 

Val Arg Ala Val Thr Ser Ala Asn lie Gin Glu Phe Ala Gly Cys Lys 

355 360 365 

aag ate ttt ggg age ctg gca ttt ctg ccg gag age ttt gat ggg gac 
1152 

Lys lie Phe Gly Ser Leu Ala Phe Leu Pro Glu Ser Phe Asp Gly Asp 

370 375 380 
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cca gcc tec aac 
1200 

Pro Ala Ser Asn 
385 

gag act ctg gaa 
1248 

Glu Thr Leu Glu 



gac age ctg cct 
12 96 

Asp Ser Leu Pro 
420 

gga cga att ctg 
1344 

Gly Arg He Leu 
435 

ggc ate age tgg 
1392 

Gly He Ser Trp 
450 

ctg gcc etc ate 
1440 

Leu Ala Leu He 
465 

ccc tgg gac cag 
1488 

Pro Trp Asp Gin 



gcc aac egg cca 
1536 . 

Ala Asn Arg Pro 
500 

cag ctg tgc gcc 
1584 

Gin Leu Cys Ala 
515 

gtc aac tgc age 
1632 

Val Asn Cys Ser 
530 

cga gta ctg cag 
1680 

Arg Val Leu Gin 



act gcc ccg etc 

Thr Ala Pro Leu 
390 

gag ate aca ggt 

Glu He Thr Gly 
405 

gac etc age gtc 
Asp Leu Ser Val 

cac aat ggc gcc 

His Asn Gly Ala 
440 

ctg ggg ctg cgc 

Leu Gly Leu Arg 
455 

cac ca,t aac ace 

His His Asn Thr 
470 

etc ttt egg aac 

Leu Phe Arg Asn 
485 

gag gac gag tgt 
Glu Asp Glu Cys 

c 9 a 999 cac tgc 

Arg Gly His Cys 
520 

cag ttc ctt egg 

Gin Phe Leu Arg 
535 

ggg etc ccc agg 
Gly Leu Pro Arg 



cag cca gag cag 

Gin Pro Glu Gin 
395 

tac eta tac ate 

Tyr Leu Tyr He 
410 

ttc cag aac ctg 

Phe Gin Asn Leu 
425 

tac teg ctg ace 
Tyr Ser Leu Thr 

tea ctg agg gaa 

Ser Leu Arg Glu 
460 

cac etc tgc ttc 

His Leu Cys Phe 
475 

ccg cac caa get 

Pro His Gin Ala 
490 

gtg ggc gag ggc 

Val Gly Glu Gly 
505 

tgg ggt cca ggg 
Trp Gly Pro Gly 

ggc cag gag tgc 

Gly Gin Glu Cys 
540 

gag tat gtg aat 
Glu Tyr Val Asn 



etc caa gtg ttt 

Leu Gin Val Phe 
400 

tea gca tgg ccg 

Ser Ala Trp Pro 
415 

caa gta ate egg 

Gin Val. He Arg 
430 

ctg caa ggg ctg 

Leu Gin Gly Leu 
445 

ctg ggc agt gga 
Leu Gly Ser Gly 

gtg cac acg gtg 

Val His Thr Val 
480 

ctg etc cac act 

Leu Leu His Thr 
495 

ctg gcc tgc cac 

Leu Ala Cys His 
510 

ccc ace cag tgt 

Pro Thr Gin Cys 
525 

gtg gag gaa tgc 
Val Glu Glu Cys 

gcc agg cac tgt 
Ala Arg His Cys 
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545 550 555 560 

ttg ccg tgc cac cct gag tgt cag ccc cag aat ggc tea gtg acc tgt 
1728 

Leu Pro Cys His Pro Glu Cys Gin Pro Gin Asn Gly Ser Val Thr Cys 
565 570 575 

ttt gga ccg gag get gac cag tgt gtg gec tgt gee cac tat aag gac 
1776 

Phe Gly Pro Glu Ala Asp Gin Cys Val Ala Cys Ala His Tyr Lys Asp 
580 585 590 

cct ccc ttc tgc gtg gec cgc tgc ccc age ggt gtg aaa cct gac etc 
1824 

Pro Pro Phe Cys Val Ala Arg Cys Pro Ser Gly Val Lys Pro Asp Leu 
595 600 605 

tec tac atg ccc ate tgg- aag ttt cca gat gag gag ggc gca tgc cag 
1872 

Ser Tyr Met Pro lie Trp Lys Phe Pro Asp Glu Glu Gly Ala Cys Gin 

610 615 620 

cct tgc ccc ate aac tgc acc cac tec tgt gtg gac ctg gat gac aag 
1920 

Pro Cys Pro lie Asn Cys Thr His Ser Cys Val Asp Leu Asp Asp Lys 

625 630 635 ■ 640 

ggc tgc ccc gee gag cag aga gec age cct ctg acg tec ate gtc tct 
1968 

Gly Cys Pro Ala Glu Gin Arg Ala Ser Pro Leu Thr Ser lie Val Ser 
645 650 655 

9 C 9 gtg gtt ggc att ctg ctg gtc gtg gtc ttg ggg gtg gtc ttt ggg 
2016 

Ala Val Val Gly lie Leu Leu Val Val Val Leu Gly Val Val Phe Gly 
660 665 670 

ate etc ate aag cga egg cag cag aag ate egg aag tac acg atg egg 
2064 

He Leu He Lys Arg Arg Gin Gin Lys He Arg Lys Tyr Thr Met Arg 
675 680 685 

aga ctg ctg cag gaa acg gag ctg gtg gag ccg ctg aca cct age gga 
2112 

Arg Leu Leu Gin Glu Thr Glu Leu Val Glu Pro Leu Thr Pro Ser Gly 
690 695 700 

gcg atg ccc aac cag gcg cag atg egg ate ctg aaa gag acg gag ctg 
2160 

Ala Met Pro Asn Gin Ala Gin Met Arg He Leu Lys Glu Thr Glu Leu 
705 710 715 720 
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agg aag gtg aag gtg ctt gga tct ggc get ttt ggc aca gtc tac aag 
2208 

Arg Lys Val Lys Val Leu Gly Ser Gly Ala Phe Gly Thr Val Tyr Lys 
725 730 735 

ggc ate tgg ate cct gat ggg gag aat gtg aaa att cca gtg gec ate 
2256 

Gly lie Trp lie Pro Asp Gly Glu Asn Val Lys He Pro Val Ala He 
740 745 750 

aaa gtg ttg agg gaa aac aca tec ccc aaa gee aac aaa gaa ate tta 
2304 . 

Lys Val Leu Arg Glu Asn Thr Ser Pro Lys Ala Asn Lys Glu He Leu 
755 760 765 

gac gaa gca tac gtg atg get ggt gtg ggc tec cca tat gtc tec cgc 
2352 

Asp Glu Ala Tyr Val Met Ala Gly Val Gly Ser Pro Tyr Val Ser Arg 
770 775 780 

ctt ctg ggc ate tgc ctg aca tec acg gtg cag ctg gtg aca cag ctt 
2400 

Leu Leu Gly He Cys Leu Thr Ser Thr Val Gin Leu Val Thr Gin Leu 

785 790 795 800 

atg ccc tat ggc tgc etc tta gac cat gtc egg gaa aac cgc gga cgc 
244 8 

Met Pro Tyr Gly Cys Leu Leu Asp His Val Arg Glu Asn Arg Gly Arg 
805 810 815 

ctg ggc tec cag gac ctg ctg aac tgg tgt atg cag att gee aag ggg 
2496 

Leu Gly Ser Gin Asp Leu Leu Asn Trp Cys Met Gin He Ala Lys Gly 
820 825 830 

atg age tac ctg gag gat gtg egg etc gta cac agg gac ttg gee get 
2544 

Met Ser Tyr Leu Glu Asp Val Arg Leu Val His Arg Asp Leu Ala Ala 
835 840 845 

egg aac gtg ctg gtc aag agt ccc aac cat gtc aaa att aca gac ttc 
2592 

Arg Asn Val Leu Val Lys Ser Pro Asn His Val Lys He Thr Asp Phe 

850 855 860 

ggg ctg get egg ctg ctg gac att gac gag aca gag tac cat gca gat 
2640 

Gly Leu Ala Arg Leu Leu Asp He Asp Glu Thr Glu Tyr His Ala Asp 

865 870 875 880 

999 99C aag gtg ccc ate aag tgg atg gcg ctg gag tec att etc cgc 
2688 

Gly Gly Lys Val Pro He Lys Trp Met Ala Leu Glu Ser He Leu Arg 
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885 890 895 

egg egg ttc acc cac cag agt gat gtg tgg agt tat ggt gtg act gtg 
2736 

Arg Arg Phe Thr His Gin Ser Asp Val Trp Ser Tyr Gly Val Thr Val 
900 905 910 

tgg gag ctg atg act ttt ggg gec aaa cct tac gat ggg ate cca gec 
2784 

Trp Glu Leu Met Thr Phe Gly Ala Lys Pro Tyr Asp Gly lie Pro Ala 
915 920 925 

egg gag ate cct gac ctg ctg gaa aag ggg gag egg ctg ccc cag ccc 
2832 

Arg Glu lie Pro Asp Leu Leu Glu Lys Gly Glu Arg Leu Pro Gin Pro 
930 935 940 

ccc ate tgc acc att gat gtc tac atg ate atg gtc aaa tgt tgg atg 
2880 

Pro He Cys Thr He Asp Val Tyr Met He Met Val Lys Cys Trp Met 

945 950 955 960 

att gac tct gaa tgt egg cca aga ttc egg gag ttg gtg tct gaa ttc 
2928 

He Asp Ser Glu Cys Arg Pro Arg Phe Arg Glu Leu Val Ser Glu Phe 
965 970 975 

tec cgc atg gec agg gac ccc cag cgc ttt gtg gtc ate cag aat gag 
2976 

Ser Arg Met Ala Arg Asp Pro Gin Arg Phe Val Val He Gin Asn Glu 
980 985 990 

gac ttg ggc cca gee agt ccc ttg gac age acc ttc tac cgc tea ctg 
3024 

Asp Leu Gly Pro Ala Ser Pro Leu Asp Ser Thr Phe Tyr Arg Ser Leu 
995 1000 1005 

ctg gag gac gat gac atg ggg gac ctg gtg gat get gag gag tat ctg 
3072 

Leu Glu Asp Asp Asp Met Gly Asp Leu Val Asp Ala Glu Glu Tyr Leu 
1010 1015 1020 

gta ccc cag cag ggc ttc ttc tgt cca gac cct gee ccg ggc get ggg 
3120 

Val Pro Gin Gin Gly Phe Phe Cys Pro Asp Pro Ala Pro Gly Ala Gly 
1025 .1030 1035 1040 

ggc atg gtc cac cac agg cac cgc age tea tct acc agg agt ggc ggt 
3168 

Gly Met Val His His Arg His Arg Ser Ser Ser Thr Arg Ser Gly Gly 

1045 1050 1055 
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999 9^c ctg aca eta ggg ctg gag ccc tct gaa gag gag gec ccc agg 
3216 ' 

Gly Asp Leu Thr Leu Gly Leu Glu Pro Ser Glu Glu Glu Ala Pro Arg 
1060 1065 1070 

tct cca ctg gca ccc tec gaa ggg get ggc tec gat gta ttt gat ggt 
3264 

Ser Pro Leu Ala Pro Ser Glu Gly Ala Gly Ser Asp Val Phe Asp Gly 

1075 1080 1085 

gac ctg gga atg ggg gca gee aag ggg ctg caa age etc ccc aca cat 
3312 

Asp Leu Gly Met Gly Ala Ala Lys Gly Leu Gin Ser Leu Pro Thr His 
1090 1095 1100 

gac ccc age cct eta cag egg tac agt gag gac ccc aca gta ccc ctg 
3360 

Asp Pro Ser Pro Leu Gin Arg Tyr Ser Glu Asp Pro Thr Val Pro Leu 
1105 . 1110 1115 1120 

ccc tct gag act gat ggc tac gtt gec ccc ctg ace tgc age ccc cag 
3408 

Pro Ser Glu Thr Asp Gly Tyr Val Ala Pro Leu Thr Cys Ser Pro Gin 
1125 1130 1135 

cct gaa tat gtg aac cag cca gat gtt egg ccc cag ccc cct teg ccc 
3456 

Pro Glu Tyr Val Asn Gin Pro Asp Val Arg Pro Gin Pro Pro Ser Pro 
1140 1145 1150 

cga gag ggc cct ctg cct get gec cga cct get ggt gee act ctg gaa 
3504 

Arg Glu Gly Pro Leu Pro Ala Ala Arg Pro Ala Gly Ala Thr Leu Glu 
1155 1160 1165 

agg gee aag act etc tec cca ggg aag aat ggg gtc gtc aaa gac gtt 
3552 

Arg Ala Lys Thr Leu Ser Pro Gly Lys Asn Gly Val Val Lys Asp Val 
1170 1175 1180 

ttt gec ttt ggg ggt gee gtg gag aac ccc gag tac ttg aca ccc cag 
3600 

Phe Ala Phe Gly Gly Ala Val Glu Asn Pro Glu Tyr Leu Thr Pro Gin 
1185 1190 1195 1200 

99£i gga get gee cct cag ccc cac cct cct cct gee ttc age cca gec 
364 8 

Gly Gly Ala Ala Pro Gin Pro His Pro Pro Pro Ala Phe Ser Pro Ala 
1205 1210 1215 

ttc gac aac etc tat tac tgg gac cag gac cca cca gag egg ggg get 
3696 

Phe Asp Asn Leu Tyr Tyr Trp Asp Gin Asp Pro Pro Glu Arg Gly Ala 
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1220 1225 1230 

cca ccc age acc ttc aaa ggg aca cct acg gca gag aac cca gag tac 
3744 

Pro Pro Ser Thr Phe Lys Gly Thr Pro Thr Ala Glu Asn Pro Glu Tyr 
1235 1240 1245 

ctg ggt ctg gac gtg cca gtg 
3765 

Leu Gly Leu Asp Val Pro Val 
1250 1255 



<210> 6 
<211> 1255 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Glu Leu Ala Ala Leu Cys Arg Trp Gly Leu Leu Leu Ala Leu Leu 

1 5 10 15 

Pro Pro Gly Ala Ala Ser Thr Gin Val Cys Thr Gly Thr Asp Met Lys 

20 25 30 

Leu Arg Leu Pro Ala Ser Pro Glu Thr His Leu Asp Met Leu Arg His 

35 40 45 

Leu Tyr Gin Gly Cys Gin Val Val Gin Gly. Asn Leu Glu Leu Thr Tyr 

50 55 60 

Leu Pro Thr Asn Ala Ser Leu Ser Phe Leu Gin Asp lie Gin Glu Val 
65 70 75 80 

Gin Gly Tyr Val Leu lie Ala His Asn Gin Val Arg Gin Val Pro Leu 

85 90 95 

Gin Arg Leu Arg lie Val Arg Gly Thr Gin Leu Phe Glu Asp Asn Tyr 

100 105 110 

Ala Leu Ala Val Leu Asp Asn Gly Asp Pro Leu Asn Asn Thr Thr Pro 

115 120 125 

Val Thr Gly Ala Ser Pro Gly Gly Leu Arg Glu Leu Gin Leu Arg Ser 

130 135 140 

Leu Thr Glu lie Leu Lys Gly Gly Val Leu lie Gin Arg Asn Pro Gin 
145 150 155 160 

Leu Cys Tyr Gin Asp Thr lie Leu Trp Lys Asp He Phe His Lys Asn 

165 170 175 

Asn Gin Leu Ala Leu Thr Leu He Asp Thr Asn Arg Ser Arg Ala Cys 

180 185 190 

His Pro Cys Ser Pro Met Cys Lys Gly Ser Arg Cys Trp Gly Glu Ser 

195 200 205 

Ser Glu Asp Cys Gin Ser Leu Thr Arg Thr Val Cys Ala Gly Gly Cys 

210 215 220 

Ala Arg Cys Lys Gly Pro Leu Pro Thr Asp Cys Cys His Glu Gin Cys 
225 230 235 . 240 

Ala Ala Gly Cys Thr Gly Pro Lys His Ser Asp Cys Leu Ala Cys Leu 

245 250 255 

His Phe Asn His Ser Gly lie Cys Glu Leu His Cys Pro Ala Leu Val 
260 265 270 
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Thr 


Tyr 


Asn 


Thr 


Asp 


Thr 


Phe 


Glu 






275 










280 


Tyr 


Thr 


Phe Gly 


Ala 


Ser 


Cys 


Val 




290 










295 




Ser 


Thr 


Asp 


Val 


Gly. 


Ser 


Cys 


Thr 


305 










310 






Glu 


Val 


Thr 


Ala 


Glu 


Asp 


Gly 


Thr 










325 








Pro 


Cys 


Ala Arg 


Val 


Cys 


Tyr 


Gly 








340 










Val 


Arg 


Ala 


Val 


Thr 


Ser 


Ala 


Asn 






355 










360 


Lys 


He 


Phe Gly 


Ser 


Leu 


Ala 


Phe 




370 










375 




Pro 


Ala 


Ser 


Asn 


Thr 


Ala 


Pro 


Leu 


385 










390 






Glu 


Thr 


Leu 


Glu 


Glu 


He 


Thr 


Gly 










405 








Asp 


Ser 


Leu 


Pro 


Asp 


Leu 


Ser 


Val 








420 










Gly 


Arg 


He 


Leu 


His 


Asn 


Gly 


Ala 






435 










440 


Gly 


He 


Ser 


Trp 


Leu 


Gly 


Leu 


Arg 




450 










455 




Leu 


Ala 


Leu 


He 


His 


His 


Asn 


Thr 


465 










470 






Pro 


Trp 


Asp 


Gin 


Leu 


Phe 


Arg 


Asn 










485 








Ala 


Asn 


Arg 


Pro 


Glu 


Asp 


Glu 


Cys 








500 










Gin 


Leu 


Cys 


Ala 


Arg 


Gly 


His 


Cys 






515 










520 


Val 


Asn 


Cys 


Ser 


Gin 


Phe 


Leu 


Arg 




530 










535 




Arg 


Val 


Leu 


Gin 


Gly 


Leu 


Pro 


Arg 


545 










550 






Leu 


Pro 


Cys 


His 


Pro 


Glu 


Cys 


Gin 










565 








Phe 


Gly 


Pro 


Glu 


Ala 


Asp 


Gin 


Cys 








580 










Pro 


Pro 


Phe 


Cys 


Val 


Ala 


Arg 


Cys 






595 










600 


Ser 


Tyr 


Met 


Pro 


He 


Trp 


Lys 


Phe 




610 










615 




Pro 


Cys 


Pro 


He 


Asn 


Cys 


Thr 


His 


625 










630 






Gly 


Cys 


Pro 


Ala 


Glu 


Gin 


Arg 


Ala 










645 








Ala 


Val 


Val Gly 


He 


Leu 


Leu 


Val 








660 










lie 


Leu 


He 


Lys 


Arg 


Arg 


Gin 


Gin 






675 










680 


Arg 


Leu 


Leu 


Gin 


Glu 


Thr 


Glu 


Leu 



Ser Met 


Pro 


Asn 


Pro 


Glu 


Gly 


Arg 








285 








Thr Ala 


Cys 


Pro 


Tyr 


Asn 


Tyr 


Leu 






300 










Leu Val 


Cys 


Pro 


Leu 


His 


Asn 


Gin 




315 










320 


Gin Arg 


Cys 


Glu Lys 


Cys 


Ser 


Lys 


330 










335 




Leu Gly 


Met 


Glu 


His 


Leu 


Arg 


Glu 


345 








350 






He Gin 


Glu 


Phe 


Ala 


Gly 


Cys 


Lys 








365 








Leu Pro 


Glu 


Ser 


Phe 


Asp 


Gly 


Asp 






380 










Gin Pro 


Glu 


Gin 


Leu 


Gin 


Val 


Phe 




395 










400 


Tyr Leu 


Tyr 


He 


Ser 


Ala 


Trp 


Pro 


410 










415 




Phe Gin 


Asn 


Leu 


Gin 


Val 


He 


Arg 


425 








430 






Tyr Ser 


Leu 


Thr 


Leu 


Gin 


Gly 


Leu 








445 








Ser Leu 


Arg 


Glu 


Leu 


Gly 


Ser 


Gly 






460 










His Leu 


Cys 


Phe 


Val 


His 


Thr 


Val 




475 










480 


Pro His 


Gin 


Ala 


Leu 


Leu 


His 


Thr 


490 










495 




Val Gly 


Glu 


Gly 


Leu 


Ala 


Cys 


His 


505 








510 






Trp Gly 


Pro 


Gly Pro 


Thr 


Gin 


Cys 








525 








Gly Gin 


Glu 


Cys 


Val 


Glu 


Glu 


Cys 






540 










Glu Tyr 


Val 


Asn 


Ala 


Arg 


His 


Cys 




555 










560 


Pro Gin 


Asn 


Gly Ser 


Val 


Thr 


Cys 


570 










575 




Val Ala 


Cys 


Ala His 


Tyr 


Lys 


Asp 


585 








590 






Pro Ser 


Gly 


Val 


Lys 


Pro 


Asp 


Leu 








605 








Pro Asp 


Glu 


Glu Gly 


Ala 


Cys 


Gin 






62 0 










Ser Cys 


Val 


Asp Leu 


Asp 


Asp 


Lys 




635 










640 


Ser Pro 


Leu 


Thr 


Ser 


He 


Val 


Ser 


650 










655 




Val Val 


Leu 


Gly Val 


Val 


Phe 


Gly 


665 








670 






Lys He 


Arg 


Lys 


Tyr 


Thr 


Met 


Arg 








685 








Val Glu 


Pro 


Leu 


'Thr 


Pro 


Ser 


Gly 
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690 










695 










700 










Ala 


Met 


Pro 


Asn 


Gin 


Ala 


Gin 


Met 


Arg 


He 


Leu 


Lys 


Glu 


Thr 


Glu 


Leu 


705 










710 










715 










720 


Arg 


Lys 


Val 


Lys 


Val 


Leu 


Gly 


Ser 


Gly 


Ala 


Phe 


Gly 


Thr 


Val 


Tyr 


Lys 










725 










730 










735 




Gly 


lie 


Trp 


He 


Pro 


Asp 


Gly 


Glu 


Asn 


Val 


Lys 


He 


Pro 


val 


Ala 


He 








740 










745 










750 






Lys 


Val 


Leu 


Arg 


Glu 


Asn 


Thr 


Ser 


Pro 


Lys 


Ala 


Asn 


Lys 


Glu 


He 


Leu 






755 










760 










765 








Asp 


Glu 


Ala 


Tyr 


Val 


Met 


Ala 


Gly 


Val 


Gly 


Ser 


Pro 


Tyr 


Val 


Ser 


Arg 




770 










775 










780 










Leu 


Leu 


Gly 


He 


Cys 


Leu 


Thr 


Ser 


Thr 


Val 


Gin 


Leu 


Val 


Thr 


Gin 


Leu 


785 










790 










795 










800 


Met 


Pro 


Tyr 


Gly 


Cys 


Leu 


Leu 


Asp 


His 


Val 


Arg 


Glu 


Asn 


Arg 


Gly 


Arg 










805 










810 










815 




Leu 


Gly 


Ser 


Gin 


Asp 


Leu 


Leu 


Asn 


Trp 


Cys 


Met 


Gin 


He 


Ala 


Lys 


Gly 








820 










825 










830 






Met 


Ser 


Tyr 


Leu 


Glu 


Asp 


Val 


Arg 


Leu 


Val 


His 


Arg 


Asp 


Leu 


Ala 


Ala 






835 










840 










845 








Arg 


Asn 


Val 


Leu 


Val 


Lys 


Ser 


Pro 


Asn 


His 


Val 


Lys 


He 


Thr 


Asp 


Phe 




850 










855 










860 










Gly 


Leu 


Ala 


Arg 


Leu 


Leu 


Asp 


He 


Asp 


Glu 


Thr 


Glu 


Tyr 


His 


Ala 


Asp 


865 










870 










875 










880 


Gly 


Gly 


Lys 


Val 


Pro 


He 


Lys 


Trp 


Met 


Ala 


Leu 


Glu 


Ser 


He 


Leu 


Arg 










885 










890 










895 




Arg 


Arg 


Phe 


Thr 


His 


Gin 


Ser 


Asp 


Val 


Trp 


Ser 


Tyr 


Gly 


Val 


Thr 


Val 








900 










905 










910 






Trp 


Glu 


Leu 


Met 


Thr 


Phe 


Gly 


Ala 


Lys 


Pro 


Tyr 


Asp 


Gly 


He 


Pro 


Ala 






915 










920 










925 








Arg 


Glu 


He 


Pro 


Asp 


Leu 


Leu 


Glu 


Lys 


Gly 


Glu 


Arg 


Leu 


Pro 


Gin 


Pro 




930 










935 










940 










Pro 


He 


Cys 


Thr 


He 


Asp 


Val 


Tyr 


Met 


He 


Met 


Val 


Lys 


Cys 


Trp 


Met 


945 










950 










955 










960 


lie 


Asp 


Ser 


Glu 


Cys 


Arg 


Pro 


Arg 


Phe 


Arg 


Glu 


Leu 


Val 


Ser 


Glu 


Phe 










965 










970 










975 




Ser 


Arg 


Met 


Ala 


Arg 


Asp 


Pro 


Gin 


Arg 


Phe 


Val 


Val 


He 


Gin 


Asn 


Glu 








980 










985 










990 






Asp 


Leu 


Gly 


Pro 


Ala 


Ser 


Pro 


Leu 


Asp 


Ser 


Thr 


Phe 


Tyr 


Arg 


Ser 


Leu 






995 










1000 








1005 






Leu 


Glu 


Asp 


Asp 


Asp 


Met 


Gly 


Asp Leu 


Val 


Asp 


Ala 


Glu 


Glu 


Tyr 


Leu 




1010 








1015 








1020 








Val 


Pro 


Gin 


Gin 


Gly Phe 


Phe 


Cys 


Pro 


Asp 


Pro 


Ala Pro Gly 


Ala 


Gly 



1025 1030 1035 1040 

Gly Met Val His His Arg His Arg Ser Ser Ser Thr Arg Ser Gly Gly 

1045 1050 1055 

Gly Asp Leu Thr Leu Gly Leu Glu Pro Ser Glu Glu Glu Ala Pro Arg 

1060 1065 1070 

Ser Pro Leu Ala Pro Ser Glu Gly Ala Gly Ser Asp Val Phe Asp Gly 

1075 1080 1085 

Asp Leu Gly Met Gly Ala Ala Lys Gly Leu Gin Ser Leu Pro Thr His 

1090 1095 1100 

Asp Pro Ser Pro Leu Gin Arg Tyr Ser Glu Asp Pro Thr Val Pro Leu 
1105 1110 1115 1120 
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Pro Ser Glu Thr Asp Gly Tyr Val Ala Pro Leu Thr Cys Ser Pro Gin 

1125 1130 1135 

Pro Glu Tyr Val Asn Gin Pro Asp Val Arg Pro Gin Pro Pro Ser Pro 

1140 1145 1150 

Arg Glu Gly Pro Leu Pro Ala Ala Arg Pro Ala Gly Ala Thr Leu Glu 

1155 1160 1165 

Arg Ala Lys Thr Leu Ser Pro Gly Lys Asn Gly Val Val Lys Asp Val 

1170 1175 1180 

Phe Ala Phe Gly Gly Ala Val Glu Asn Pro Glu Tyr Leu Thr Pro Gin 
1185 1190 . 1195 1200 

Gly Gly Ala Ala Pro Gin Pro His Pro Pro Pro Ala Phe Ser Pro Ala 

1205 1210 1215 

Phe Asp Asn Leu Tyr Tyr Trp Asp Gin Asp Pro Pro Glu Arg Gly Ala 

1220 1225 1230 

Pro Pro Ser Thr Phe Lys Gly Thr Pro Thr Ala Glu Asn Pro Glu Tyr 

1235 1240 1245 

Leu Gly Leu Asp Val Pro Val 
1250 1255 



<210> 7 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 7 

ctagtctcta gatccgaggt gggcaactct 3 0 

<210> 8 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 8 

taccgatcta gatttcggac agagaccccc 30 

<210> 9 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 9 

ctactctcta gatccgaggg caactct 27 
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<210> 10 
<211> 30 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> primer 



<400> 10 

taccgatcta gatttcggac agagaccccc 



30 



<210> 11 
<211> 21 
<212> DNA 



<213> Artificial Sequence 



<220> . 
<223> primer 



<400> 11 

gcgatgctgc gaaccaatac c 



21 



<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 12 

ggtattggtt cgcagcatcg c 21 



<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 13 

cgatgctgag gcccaatacc ag 22 

<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<210> 13 



<400> 14 
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ctggtattgg cgctcagcat eg 



22 



<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 15 

gacactgtac gcgctctacg ag 22 

<210> 16 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<210> 17 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 17 

gggaaccttg cgattgtgct c 21 

<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<210> 19 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 16 

ctegtagage eggtacagtg tc 



22 



<400> 18 

gagcacaatc gcaaggttcc c 



21 
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<400> 19 

cttgagattg cgctcacgg 



19 



<210> 20 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 20 

cgtgagcgca atctcaaggt 20 

<210> 21 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<210> 22 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 22 

gtagagtaga gcgttcattc atgg 24 

<210> 23 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 21 

ccatgaatga agcctctact ctac 



24 



<400> 23 

gacccaggcc tacgatggga a 



21 



<210> 24 
<211> 22 
<212> DNA 



<213> Artificial Sequence 
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<220> . 
<223> primer 

<400> 24 

cttcccatcg taggcctggg tc 
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